Publications

Detailed Information

Self-imitating Feedback Generation Using GAN for Computer-Assisted Pronunciation Training

Cited 5 time in Web of Science Cited 6 time in Scopus
Authors

Yang, Seung Hee; Chung, Minhwa

Issue Date
2019-09
Publisher
ISCA-INT SPEECH COMMUNICATION ASSOC
Citation
INTERSPEECH 2019, pp.1881-1885
Abstract
Self-imitating feedback is an effective and learner-friendly method for non-native learners in Computer-Assisted Pronunciation Training. Acoustic characteristics in native utterances are extracted and transplanted onto learner's own speech input, and given back to the learner as a corrective feedback. Previous works focused on speech conversion using prosodic transplantation techniques based on PSOLA algorithm. Motivated by the visual differences found in spectrograms of native and non-native speeches, we investigated applying GAN to generate self-imitating feedback by utilizing generator's ability through adversarial training. Because this mapping is highly under-constrained, we also adopt cycle consistency loss to encourage the output to preserve the global structure, which is shared by native and non-native utterances. Trained on 97,200 spectrogram images of short utterances produced by native and non-native speakers of Korean, the generator is able to successfully transform the non-native spectrogram input to a spectrogram with properties of self-imitating feedback. Furthermore, the transformed spectrogram shows segmental corrections that cannot be obtained by prosodic transplantation. Perceptual test comparing the self-imitating and correcting abilities of our method with the baseline PSOLA method shows that the generative approach with cycle consistency loss is promising.
ISSN
2308-457X
URI
https://hdl.handle.net/10371/186392
DOI
https://doi.org/10.21437/Interspeech.2019-1478
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share