Publications

Detailed Information

Counterfactual Generative Smoothing for Imbalanced Natural Language Classification

Cited 0 time in Web of Science Cited 3 time in Scopus
Authors

Han, Hojae; Choi, Seungtaek; Jeong, Myeongho; Park, Jin-Woo; Seung-Won, Hwang

Issue Date
2021-10
Publisher
Association for Computing Machinery
Citation
International Conference on Information and Knowledge Management, Proceedings, pp.3058-3062
Abstract
© 2021 ACM.Classification datasets are often biased in observations, leaving onlya few observations for minority classes. Our key contribution is de-tecting and reducing Under-represented (U-) and Over-represented(O-) artifacts from dataset imbalance, by proposing a Counterfac-tual Generative Smoothing approach on both feature-space anddata-space, namely CGS_f and CGS_d. Our technical contribution issmoothing majority and minority observations, by sampling a ma-jority seed and transferring to minority. Our proposed approachesnot only outperform state-of-the-arts in both synthetic and real-lifedatasets, they effectively reduce both artifact types.
URI
https://hdl.handle.net/10371/184199
DOI
https://doi.org/10.1145/3459637.3482077
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share