Publications

Detailed Information

Modulation Spectrum-based Postfiltering of Synthesized Speech in the Wavelet Domain : 파형요소 도메인에서의 변조 스펙트럼 기반 음성합성 후처리

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors

장세영

Advisor
김남수
Major
공과대학 전기·정보공학부
Issue Date
2017-08
Publisher
서울대학교 대학원
Keywords
PostfilteringModulation spectrum (MS)Discrete wavelet transform (DWT)Dual-tree complex wavelet transform (DTCWT)Hidden Markov tree (HMT)
Description
학위논문 (석사)-- 서울대학교 대학원 공과대학 전기·정보공학부, 2017. 8. 김남수.
Abstract
This thesis presents a wavelet-domain measure used in postfiltering applications. Quality of HMM-based (hidden Markov model-based) parametric speech synthesis is degraded due to the over-smoothing effect, where the trajectory of generated speech parameters is smoothed out and lacks dynamics. The conventional method uses the modulation spectrum (MS) to quantify the effect of over-smoothing by measuring the spectral tilt in the MS. In order to enhance the performance, a modified version of the MS called the scaled modulation spectrum (SMS), which essentially separates the MS in different bands, is proposed and utilized in postfiltering. The performance of two types of wavelets, the discrete wavelet transform (DWT) and the dual-tree complex wavelet transform (DTCWT), are evaluated. We also extend the SMS into a hidden Markov tree (HMT) model, which represents the interdependencies of the coefficients. Experimental results show that the proposed method performs better.
Language
English
URI
https://hdl.handle.net/10371/137412
Files in This Item:
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share