Publications

Detailed Information

KOREAN DIALECT IDENTIFICATION BASED ON INTONATION MODELING

Cited 2 time in Web of Science Cited 3 time in Scopus
Authors

Lee, Jooyoung; Kim, Kyungwha; Chung, Minhwa

Issue Date
2021-11
Publisher
IEEE
Citation
2021 24TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), pp.168-173
Abstract
Korean dialect identification (K-DID) is a challenging task due to its relatively unexplored field of study, mutual comprehensibility between the dialects, and lack of sufficient Korean dialect datasets available in the past. With large-scaled dialect datasets now available, this paper proposes intonational modeling of the Korean dialects by feeding frame-wise acoustic features on sequential modeling of a neural network. Compared to previous prosodic labeling with syllable-based pitch marking, our approach of intonation modeling is realized with the combination of a set of spectral features, including fundamental frequency, trained on a bidirectional LSTM network with attention mechanism. We believe the attention mechanism enables the detection of dialect-rich segments hidden among the dominant non-dialect segments within the same utterance. We test the networks on different combinations of speaker ages and speech styles. The best performance of the K-DID is achieved with 68.51% in utterance-level accuracy, which surpasses our previous work.
URI
https://hdl.handle.net/10371/187154
DOI
https://doi.org/10.1109/O-COCOSDA202152914.2021.9660537
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share