KOREAN DIALECT IDENTIFICATION BASED ON INTONATION MODELING

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

KOREAN DIALECT IDENTIFICATION BASED ON INTONATION MODELING

Cited 2 time in Web of Science Cited 3 time in Scopus

Citation: 2021 24TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), pp.168-173

Abstract: Korean dialect identification (K-DID) is a challenging task due to its relatively unexplored field of study, mutual comprehensibility between the dialects, and lack of sufficient Korean dialect datasets available in the past. With large-scaled dialect datasets now available, this paper proposes intonational modeling of the Korean dialects by feeding frame-wise acoustic features on sequential modeling of a neural network. Compared to previous prosodic labeling with syllable-based pitch marking, our approach of intonation modeling is realized with the combination of a set of spectral features, including fundamental frequency, trained on a bidirectional LSTM network with attention mechanism. We believe the attention mechanism enables the detection of dialect-rich segments hidden among the dominant non-dialect segments within the same utterance. We test the networks on different combinations of speaker ages and speech styles. The best performance of the K-DID is achieved with 68.51% in utterance-level accuracy, which surpasses our previous work.

Appears in Collections:

Show Full Item Record

Find it @ SNU

SNS Share