Title :
A Speaker Adaptation Technique for MRHSMM-Based Style Control of Synthetic Speech
Author :
Nose, Takashi ; Kato, Yoichi ; Kobayashi, Takao
Author_Institution :
Interdisciplinary Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama
Abstract :
This paper describes a speaker adaptation technique for style control based on multiple regression hidden semi-Markov model (MRHSMM), In the MRHSMM-based style control technique, when available training data is very small, the resultant model would produce unnatural sounding speech. To overcome this problem, we propose a model adaptation technique for MRHSMM, which is similar to the MLLR adaptation technique used in speech recognition and speech synthesis. We formulate the model adaptation problem for MRHSMM based on a linear transformation framework and derive re-estimation formulas for transformation matrices in ML sense. We also describe the results of subjective evaluation tests.
Keywords :
hidden Markov models; speaker recognition; speech synthesis; MRHSMM; linear transformation framework; model adaptation technique; multiple regression hidden semiMarkov model; speaker adaptation technique; speech recognition; speech synthesis; style control technique; style synthetic speech; transformation matrices; unnatural sounding speech; Adaptation model; Context modeling; Hidden Markov models; Loudspeakers; Maximum likelihood linear regression; Nose; Speech recognition; Speech synthesis; Testing; Training data; Expressive speech synthesis; Hidden Markov model; MLLR; Speaker adaptation; Style control;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2007.367042