DocumentCode :
2801269
Title :
Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis
Author :
Lei, Ming ; Ling, Zhen-Hua ; Dai, Li-Rong
Author_Institution :
iFLYTEK Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
4230
Lastpage :
4233
Abstract :
This paper presents a minimum generation error (MGE) training method using weighted Euclidean distance measure on line spectral pairs (LSP) for HMM-based speech synthesis. In this paper, weighted Euclidean distance on LSP is introduced as the measurement of generation error to improve the consistency between the model training criterion and the subjective perception on the distortion of synthetic speech. Several common weighting techniques are investigated and compared within the MGE training framework. The experimental results show that the formant bounded weighting (FBW) method achieves the best performance, which improves the naturalness of synthetic speech significantly compared with the Euclidean LSP distance measure. Compared with the MGE training using log spectral distortion (LSD) measure, the FBW criterion can achieve similar performance on naturalness with much less computation complexity of model training.
Keywords :
distortion; hidden Markov models; speech synthesis; HMM based speech synthesis; formant bounded weighting method; line spectral pairs; log spectral distortion measure; minimum generation error training; synthetic speech distortion; weighted Euclidean distance; Acceleration; Computational modeling; Distortion measurement; Euclidean distance; Hidden Markov models; Maximum likelihood estimation; Predictive models; Speech synthesis; Synthesizers; Weight measurement; Speech synthesis; hidden Markov model; line spectral pairs; minimum generation error;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495688
Filename :
5495688
Link To Document :
بازگشت