DocumentCode
730657
Title
Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis
Author
Takamichi, Shinnosuke ; Toda, Tomoki ; Black, Alan W. ; Nakamura, Satoshi
Author_Institution
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol. (NAIST), Nara, Japan
fYear
2015
fDate
19-24 April 2015
Firstpage
4210
Lastpage
4214
Abstract
This paper proposes a novel parameter generation algorithm for high-quality speech generation in Hidden Markov Model (HMM)-based speech synthesis. One of the biggest issues causing significant quality degradation is the over-smoothing effect often observed in generated parameter trajectories. Global Variance (GV) is known as a feature well correlated with the over-smoothing effect and a metric on the GV of the generated parameters is effectively used as a penalty term in the conventional parameter generation. However, the quality of the synthetic speech is far from that of the natural speech. Recently, we have found that a Modulation Spectrum (MS) of the generated parameters, which is also regarded as an extension of the GV, is more sensitively correlated with the over-smoothing effect than the GV. This paper incorporates a metric on the MS as a new penalty term in the proposed parameter generation algorithm. The experimental results demonstrate that the proposed parameter generation algorithm considering the MS yields significant improvements in synthetic speech quality compared to the conventional parameter generation algorithm considering the GV.
Keywords
hidden Markov models; modulation spectra; speech synthesis; HMM-based speech synthesis; generated parameter trajectory; global variance; hidden Markov model; high-quality speech generation; modulation spectrum; natural speech; over-smoothing effect; parameter generation algorithm; synthetic speech quality; Adaptation models; Hidden Markov models; Speech; Trajectory; HMM-based speech synthesis; global variance; modulation spectrum; over-smoothing; parameter generation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178764
Filename
7178764
Link To Document