DocumentCode :
2703906
Title :
A Speaker Adaptation Technique for MRHSMM-Based Style Control of Synthetic Speech
Author :
Nose, Takashi ; Kato, Yoichi ; Kobayashi, Takao
Author_Institution :
Interdisciplinary Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This paper describes a speaker adaptation technique for style control based on multiple regression hidden semi-Markov model (MRHSMM), In the MRHSMM-based style control technique, when available training data is very small, the resultant model would produce unnatural sounding speech. To overcome this problem, we propose a model adaptation technique for MRHSMM, which is similar to the MLLR adaptation technique used in speech recognition and speech synthesis. We formulate the model adaptation problem for MRHSMM based on a linear transformation framework and derive re-estimation formulas for transformation matrices in ML sense. We also describe the results of subjective evaluation tests.
Keywords :
hidden Markov models; speaker recognition; speech synthesis; MRHSMM; linear transformation framework; model adaptation technique; multiple regression hidden semiMarkov model; speaker adaptation technique; speech recognition; speech synthesis; style control technique; style synthetic speech; transformation matrices; unnatural sounding speech; Adaptation model; Context modeling; Hidden Markov models; Loudspeakers; Maximum likelihood linear regression; Nose; Speech recognition; Speech synthesis; Testing; Training data; Expressive speech synthesis; Hidden Markov model; MLLR; Speaker adaptation; Style control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2007.367042
Filename :
4218230
Link To Document :
بازگشت