DocumentCode :
337475
Title :
Fast speaker adaptation using a priori knowledge
Author :
Kuhn, R. ; Nguyen, R. ; Junqua, J.C. ; Boman, R. ; Niedzielski, N. ; Fincke, S. ; Field, K. ; Contolini, M.
Author_Institution :
Speech Technol. Lab., Panasonic Technol. Inc., Santa Barbara, CA, USA
Volume :
2
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
749
Abstract :
Previously, we presented a radically new class of fast adaptation techniques for speech recognition, based on prior knowledge of speaker variation. To obtain this prior knowledge, one applies a dimensionality reduction technique to T vectors of dimension D derived from T speaker-dependent (SD) models. This offline step yields T basis vectors, the eigenvoices. We constrain the model for new speaker S to be located in the space spanned by the first K eigenvoices. Speaker adaptation involves estimating K eigenvoice coefficients for the new speaker; typically, K is very small compared to original dimension D. Here, we review how to find the eigenvoices, give a maximum-likelihood estimator for the new speaker´s eigenvoice coefficients, and summarize mean adaptation experiments carried out on the Isolet database. We present new results which assess the impact on performance of changes in training of the SD models. Finally, we interpret the first few eigenvoices obtained
Keywords :
adaptive signal processing; maximum likelihood estimation; principal component analysis; speech recognition; Isolet database; a priori knowledge; basis vectors; dimensionality reduction technique; eigenvoice coefficients; fast speaker adaptation; maximum-likelihood estimator; mean adaptation experiments; offline step; performance; principal component analysis; speaker variation; speaker-dependent models; speech recognition; training; vector dimension; Databases; Hidden Markov models; Independent component analysis; Laboratories; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Speech recognition; Surfaces; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.759776
Filename :
759776
Link To Document :
بازگشت