DocumentCode :
2175411
Title :
Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis
Author :
Lei, Ming ; Ling, Zhen-Hua ; Dai, Li-Rong
Author_Institution :
iFLYTEK Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4712
Lastpage :
4715
Abstract :
Ordering property is an important property of LSP and closely connected with the naturalness of reconstructed speech. When LSP is adopted as spectrum feature in HMM-based parametric speech synthesis, the ordering property cannot be guaranteed because diagonal covariance matrix is used in conventional system and the cross dimension correlation of LSP vector is ignored. It will cause un stable issue in synthesized speech. In this paper, we propose some methods to preserve the ordering property of generated LSPs for MGE training by introducing mis-ordering related distance measurements into model training criterion. Experimental results show that two methods can alleviate the mis-orderings significantly without degrading the MGE performance, and one of which, the minimum mis-ordering counting method, requires no acoustic observations for model optimization.
Keywords :
hidden Markov models; optimisation; speech synthesis; HMM-based speech synthesis; MGE training; covariance matrix; distance measurements; generated LSPS; generation error training; model optimization; Acoustics; Hidden Markov models; Neodymium; Optimization; Speech; Speech synthesis; Training; Hidden Markov model; line spectrum pair; minimum generation error; ordering property; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947407
Filename :
5947407
Link To Document :
بازگشت