Browse

A Corpus-Based Study on Segment Variations in English Speech Produced by Korean Learners
한국인 학습자의 영어 분절음 변이에 관한 코퍼스 기반 연구

DC Field Value Language
dc.contributor.advisor정민화-
dc.contributor.author홍혜진-
dc.date.accessioned2017-07-14T01:03:35Z-
dc.date.available2017-07-14T01:03:35Z-
dc.date.issued2015-02-
dc.identifier.other000000025652-
dc.identifier.urihttps://hdl.handle.net/10371/121619-
dc.description학위논문 (박사)-- 서울대학교 대학원 : 언어학과, 2015. 2. 정민화.-
dc.description.abstractThis study investigates segment variations occurring in a large speech corpus produced by Korean learners and examines the effects of linguistic and extra-linguistic factors on the segment variations produced by these learners.
A variation matrix of the learners speech production was generated from manual transcriptions of their speech corpus, and noticeable variations produced by the learners were determined based on the variation matrix. The variation matrix of the native speech was generated from the TIMIT corpus that exhibited various dialectal variations. The most noticeable variations in the learners speech were determined by introducing a method of comparing the variation matrices of both learners and native speakers. The most noticeable variations of the learners included five vocalic (insertion of /ɯ/, substitution of /oʊ/ for /ɔ/, substitution of /oʊ/ for /ɑ/, substitution of /ɑ/ for /ʌ/, and substitution of /æ/ for /ʌ/) and four consonantal (substitution of /s/ for /z/, substitution of /d/ for /ð/, substitution of /b/ for /v/, and substitution of /p/ for /f/) variations. The results demonstrated that the learners exhibited segment variations that differed from those of native speakers. The learners were greatly affected by orthography for vowels, while effects of the native language were observed in the consonantal variations of the learners.
To investigate the linguistic and extra-linguistic factors constraining the segment variations of the learners, the effects of the factors on the most noticeable variations in only the learners speech were analyzed using generalized linear mixed models. The results indicated that the most noticeable variations produced by the learners were affected by the linguistic and extra-linguistic factors. The segmental contexts greatly affected the most noticeable variations of the learners, and the effect of orthography was found in their most noticeable vocalic variations. For the extra-linguistic factors, both learner gender and speech rate affected the most noticeable variations of the learners.
This dissertation describes the first quantitative study of English segment variations produced by Korean learners using a large-scale speech corpus. In addition, the study contributes to investigating the general effects of the linguistic and extra-linguistic factors on the variations of learners. The results of this study can be used in both context-independent and context-dependent modeling of segment variations to improve the performance of non-native speech recognition systems.
-
dc.description.tableofcontentsAbstract i
Chapter Introduction 1
1.1 Speech recognition and non-native speech 1
1.2 Purpose of the study 2
1.3 Scope of the study 3
1.4 Outline of the dissertation 4
Chapter General Background 7
2.1 L2 speech production 7
2.2 English segments produced by Korean learners 11
2.2.1 Comparison of English and Korean phonemes 11
2.2.2 English vowels produced by Korean learners 12
2.2.3 English consonants produced by Korean learners 14
Chapter Segment Variations Produced by Korean Learners 17
3.1 Method 18
3.1.1 English speech of Korean learners 18
3.1.1.1 Corpus 18
3.1.1.2 Transcriptions 19
3.1.2 English speech of native speakers 26
3.1.2.1 Corpus 26
3.1.2.2 Transcriptions 27
3.1.3 Canonical transcriptions 27
3.1.4 Generation of variation matrices 32
3.1.5 Analyses 34
3.2 Results 40
3.2.1 Variation rates 40
3.2.2 Vocalic variations in Korean learners production 41
3.2.2.1 Noticeable vocalic variations in Korean learners production 43
3.2.2.2 The most noticeable vocalic variations in Korean learners production 45
3.2.3 Consonantal variations in Korean learners production 47
3.2.3.1 Noticeable consonantal variations in Korean learners production 50
3.2.3.2 The most noticeable consonantal variations in Korean learners production 51
3.2.3.3 Noticeable consonantal variations in Korean learners and native speakers production 55
3.3 Discussion 56
3.3.1 Variation rates 56
3.3.2 Vocalic variations in Korean learners production 59
3.3.3 Consonantal variations in Korean learners production 62
3.3.4 Contributions 63
Chapter Effects of linguistic and extra-linguistic factors on Korean learners segment variations 65
4.1 Intra- and inter-learner variability 65
4.2 Method 67
4.2.1 Data 67
4.2.2 Factors of interest 68
4.2.2.1 Linguistic factors 69
4.2.2.2 Extra-linguistic factors 74
4.2.3 Generalized linear mixed models 75
4.3 Results 78
4.3.1 Substitution of /oʊ/ for /ɔ/ 78
4.3.2 Substitution of /oʊ/ for /ɑ/ 80
4.3.3 Substitution of /ɑ/ for /ʌ/ 83
4.3.4 Substitution of /æ/ for /ʌ/ 86
4.3.5 Substitution of /s/ for /z/ 88
4.3.6 Substitution of /d/ for /ð/ 90
4.3.7 Substitution of /b/ for /v/ 92
4.3.8 Substitution of /p/ for /f/ 93
4.4 Discussion 95
4.4.1 Effects of linguistic factors 96
4.4.2 Effects of extra-linguistic factors 98
4.4.3 Contributions 100
Chapter Conclusion 101
References 105
Appendix 113
국문초록 135
-
dc.formatapplication/pdf-
dc.format.extent2080107 bytes-
dc.format.mediumapplication/pdf-
dc.language.isoen-
dc.publisher서울대학교 대학원-
dc.subjectL2 production-
dc.subjectKorean learners-
dc.subjectEnglish segments-
dc.subjectcorpus-based study-
dc.subject.ddc401-
dc.titleA Corpus-Based Study on Segment Variations in English Speech Produced by Korean Learners-
dc.title.alternative한국인 학습자의 영어 분절음 변이에 관한 코퍼스 기반 연구-
dc.typeThesis-
dc.description.degreeDoctor-
dc.citation.pagesxii, 136-
dc.contributor.affiliation인문대학 언어학과-
dc.date.awarded2015-02-
Appears in Collections:
College of Humanities (인문대학)Linguistics (언어학과)Theses (Ph.D. / Sc.D._언어학과)
Files in This Item:
  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse