DocumentCode
2175411
Title
Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis
Author
Lei, Ming ; Ling, Zhen-Hua ; Dai, Li-Rong
Author_Institution
iFLYTEK Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
fYear
2011
fDate
22-27 May 2011
Firstpage
4712
Lastpage
4715
Abstract
Ordering property is an important property of LSP and closely connected with the naturalness of reconstructed speech. When LSP is adopted as spectrum feature in HMM-based parametric speech synthesis, the ordering property cannot be guaranteed because diagonal covariance matrix is used in conventional system and the cross dimension correlation of LSP vector is ignored. It will cause un stable issue in synthesized speech. In this paper, we propose some methods to preserve the ordering property of generated LSPs for MGE training by introducing mis-ordering related distance measurements into model training criterion. Experimental results show that two methods can alleviate the mis-orderings significantly without degrading the MGE performance, and one of which, the minimum mis-ordering counting method, requires no acoustic observations for model optimization.
Keywords
hidden Markov models; optimisation; speech synthesis; HMM-based speech synthesis; MGE training; covariance matrix; distance measurements; generated LSPS; generation error training; model optimization; Acoustics; Hidden Markov models; Neodymium; Optimization; Speech; Speech synthesis; Training; Hidden Markov model; line spectrum pair; minimum generation error; ordering property; speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947407
Filename
5947407
Link To Document