Publications

Detailed Information

User Guide for KOTE: Korean Online That-gul Emotions Dataset : User Guide for KOTE: Korean Online Comments Emotions Dataset

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors

Jeon, Duyoung; Lee, Junho; Kim, Cheongtag

Issue Date
2024-05
Publisher
European Language Resources Association (ELRA)
Citation
2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, pp.17254-17270
Abstract
Despite the lack of comprehensive exploration of emotional connotations, sentiment analysis, which categorizes data as positive or negative, has been widely employed to identify emotional aspects in texts. Recently, corpora labeled with more than just valence or polarity have been built to surpass this limitation. However, most Korean emotion corpora are limited by their small size and narrow range of emotions covered. In this paper, we introduce the KOTE dataset. The KOTE dataset comprises 50,000 Korean online comments, totaling 250,000 cases, each manually labeled for 43 emotions and NO EMOTION through crowdsourcing. The taxonomy for the 43 emotions was systematically derived through cluster analysis of Korean emotion concepts within the word embedding space. After detailing the development of KOTE, we further discuss the results of fine-tuning, as well as analysis for social discrimination within the corpus.
URI
https://hdl.handle.net/10371/204794
DOI
https://doi.org/10.48550/arXiv.2205.05300
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share