مرکز منطقه ای اطلاع رساني علوم و فناوري - Model Adaptation for HMM-Based Speech Synthesis under Minimum Generation Error Criterion

DocumentCode :

2290349

Title :

Model Adaptation for HMM-Based Speech Synthesis under Minimum Generation Error Criterion

Author :

Qin, Long ; Wu, Yi-Jian ; Ling, Zhen-Hua ; Wang, Ren-Hua

Author_Institution :

Carnegie Mellon Univ., Pittsburgh, PA

fYear :

2008

fDate :

15-17 Dec. 2008

Firstpage :

539

Lastpage :

544

Abstract :

In order to solve the issues related to the maximum likelihood (ML) based HMM training for HMM-based speech synthesis, a minimum generation error (MGE) criterion had been proposed. This paper continues to apply the MGE criterion to model adaptation for HMM-based speech synthesis. We introduce a MGE linear regression (MGELR) based model adaptation algorithm, where the transforms from source HMMs to target HMMs are optimized to minimize the generation errors for the adaptation data of the target speaker. The regression matrices for both mean vector and covariance matrix of Gaussian distribution are re-estimated. The proposed MGELR approach was compared with the maximum likelihood linear regression (MLLR) based model adaptation. Experimental results indicate that the generation errors were reduced after the MGELR-based model adaptation. And from the subjective listening test, the speaker similarity and the quality of the synthesized speech using MGELR were better than the results using MLLR.

Keywords :

Gaussian distribution; covariance matrices; error correction; hidden Markov models; maximum likelihood estimation; regression analysis; speech synthesis; transforms; vectors; Gaussian distribution; HMM training; HMM-based speech synthesis; MGE linear regression based model adaptation algorithm; covariance matrix; maximum likelihood linear regression based model adaptation; mean vector; minimum generation error criterion; regression matrices; speaker similarity; subjective listening test; synthesized speech; target speaker; transforms; Adaptation model; Clustering algorithms; Covariance matrix; Hidden Markov models; Linear regression; Maximum likelihood estimation; Maximum likelihood linear regression; Speech synthesis; Training data; USA Councils; HMM-based speech synthesis; minimum generation error; model adaptation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia, 2008. ISM 2008. Tenth IEEE International Symposium on

Conference_Location :

Berkeley, CA

Print_ISBN :

978-0-7695-3454-1

Electronic_ISBN :

978-0-7695-3454-1

Type :

conf

DOI :

10.1109/ISM.2008.36

Filename :

4741223

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2290349