DocumentCode :
394310
Title :
Speaker adaptation by hierarchical EigenVoice
Author :
Onishi, Yoshijiumi ; Iso, Ken-ichi
Author_Institution :
Multimedia Res. Labs., NEC Corp., Kawasaki, Japan
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
We propose a novel speaker adaptation method, hierarchical EigenVoice (HEV). This method extends the eigenvoice through clustering the Gaussian components of HMMs into a hierarchical tree structure. It enables one to autonomously control a number of adaptation parameters (model complexity) depending on the amount of adaptation utterances from a new speaker. The experimental results of Japanese large vocabulary continuous speech recognition confirmed the significant performance increase in all range of the adaptation utterance amounts compared with the conventional speaker adaptation methods.
Keywords :
Gaussian processes; eigenvalues and eigenfunctions; hidden Markov models; pattern clustering; speech recognition; Gaussian components clustering; Japan; adaptation parameters; adaptation utterance; continuous density mixture Gaussian HMM; hidden Markov models; hierarchical EigenVoice; hierarchical tree structure; large vocabulary continuous speech recognition; model complexity; speaker adaptation; speaker adaptation method; Adaptation model; Databases; Hidden Markov models; Hybrid electric vehicles; Laboratories; Loudspeakers; Maximum likelihood linear regression; Principal component analysis; Speech recognition; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198846
Filename :
1198846
Link To Document :
بازگشت