DocumentCode
3421821
Title
Minumum generation error linear regression based model adaptation for HMM-based speech synthesis
Author
Qin, Long ; Wu, Yi-Jian ; Ling, Zhen-Hua ; Wang, Ren-Hua ; Dai, Li-Rong
Author_Institution
Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, USA
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
3953
Lastpage
3956
Abstract
Due to the inconsistency between the maximum likelihood (ML) based training and the synthesis application in HMM-based speech synthesis, a minimum generation error (MGE) criterion had been proposed for HMM training. This paper continues to apply the MGE criterion to model adaptation for HMM-based speech synthesis. We propose a MGE linear regression (MGELR) based model adaptation algorithm, where the regression matrices used to transform source models to target models are optimized to minimize the generation errors for the input speech data uttered by the target speaker. The proposed MGELR approach was compared with the maximum likelihood linear regression (MLLR) based model adaptation. Experimental results indicate that the generation errors were reduced after the MGELR-based model adaptation. And from the subjective listening test, the discrimination and the quality of the synthesized speech using MGELR were better than the results using MLLR.
Keywords
Adaptation model; Context modeling; Hidden Markov models; Linear regression; Maximum likelihood estimation; Maximum likelihood linear regression; Natural languages; Speech synthesis; Testing; Training data; Speech synthesis; linear regression; minimum generation error; model adaptation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV, USA
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518519
Filename
4518519
Link To Document