DocumentCode :
352334
Title :
Fast speaker adaptation of large vocabulary continuous density HMM speech recognizer using a basis transform approach
Author :
Boulis, Constantinos ; Digalakis, Vassilios
Author_Institution :
Dept. of Electron. & Comput. Eng., Tech. Univ. of Crete, Chania, Greece
Volume :
2
fYear :
2000
fDate :
2000
Abstract :
Maximum likelihood transformation-adaptation techniques have proven successful, but it is believed that faster convergence to speaker dependent (SD) performance can be achieved if we incorporate some form of a-priori knowledge in the adaptation process. In this paper, instead of estimating one linear transform per class of models for each new speaker, we transform the speaker-independent (SI) models using multiple linear transforms and a weight vector. To reduce the number of adaptation parameters, the multiple linear transforms are generated from training speakers and the adaptation parameters consist of a single weight vector per class. This can be seen as incorporating a-priori knowledge to our estimation process. Experiments conducted on the Spoken Language Translator database in the Swedish language using SRI´s DECIPHERTM system, show that the new method outperforms maximum likelihood linear regression on very limited adaptation data
Keywords :
hidden Markov models; maximum likelihood estimation; speech recognition; transforms; DECIPHERTM system; Spoken Language Translator database; Swedish; a-priori knowledge; adaptation process; basis transform approach; convergence; fast speaker adaptation; large vocabulary continuous density HMM speech recognizer; linear transform; maximum likelihood transformation-adaptation techniques; multiple linear transforms; speaker dependent performance; speaker-independent models; weight vector; Equations; Hidden Markov models; Maximum likelihood estimation; Maximum likelihood linear regression; Prototypes; Speech processing; Speech recognition; Transforms; Vectors; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.859128
Filename :
859128
Link To Document :
بازگشت