Title :
Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
Author :
Long Qin ; Yi-Jian Wu ; Zhen-Hua Ling ; Ren-Hua Wang ; Li-Rong Dai
Author_Institution :
Carnegie Mellon Univ., Carnegie Mellon Univ., Pittsburgh, PA
fDate :
March 31 2008-April 4 2008
Abstract :
Due to the inconsistency between the maximum likelihood (ML) based training and the synthesis application in HMM-based speech synthesis, a minimum generation error (MGE) criterion had been proposed for HMM training. This paper continues to apply the MGE criterion to model adaptation for HMM-based speech synthesis. We propose a MGE linear regression (MGELR) based model adaptation algorithm, where the regression matrices used to transform source models to target models are optimized to minimize the generation errors for the input speech data uttered by the target speaker. The proposed MGELR approach was compared with the maximum likelihood linear regression (MLLR) based model adaptation. Experimental results indicate that the generation errors were reduced after the MGELR-based model adaptation. And from the subjective listening test, the discrimination and the quality of the synthesized speech using MGELR were better than the results using MLLR.
Keywords :
error analysis; hidden Markov models; maximum likelihood estimation; regression analysis; speech synthesis; HMM-based speech synthesis; generation error minimization; maximum likelihood based training; maximum likelihood linear regression; minimum generation error criterion; minimum generation error linear regression; model adaptation; model adaptation algorithm; transform source models; Acoustic distortion; Character generation; Clustering algorithms; Computational efficiency; Distortion measurement; Hidden Markov models; Loudspeakers; Speech processing; Speech synthesis; Statistics; HMM; Speech synthesis; global variance; local variance; minimum generation error;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518686