Publications

Detailed Information

Document representation based on probabilistic word clustering in customer-voice classification

Cited 5 time in Web of Science Cited 5 time in Scopus
Authors

Lee, Younghoon; Song, Seokmin; Cho, Sungzoon; Choi, Jinhae

Issue Date
2019-02
Publisher
Springer Verlag
Citation
Pattern Analysis and Applications, Vol.22 No.1, pp.221-232
Abstract
Customer-voice data have an important role in different fields including marketing, product planning, and quality assurance. However, owing to the manual processes involved, there are problems associated with the classification of customer-voice data. This study focuses on building automatic classifiers for customer-voice data with newly proposed document representation methods based on neural-embedding and probabilistic word-clustering approaches. Semantically similar terms are classified into a common cluster. The words generated from neural embedding are clustered according to the membership strength of each word relative to each cluster derived from a probabilistic clustering method such as the fuzzy C-means clustering method or Gaussian mixture model. It is expected that the proposed method can be suitable for the classification of customer-voice data consisting of unstructured text by considering the membership strength. The results demonstrate that the proposed method achieved an accuracy of 89.24% with respect to representational effectiveness and an accuracy of 87.76% with respect to the classification performance of customer-voice data consisting of 12 classes. Further, the method provided an intuitive interpretation for the generated representation.
ISSN
1433-7541
URI
https://hdl.handle.net/10371/195522
DOI
https://doi.org/10.1007/s10044-018-00772-1
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share