Fast speaker adaptation using a priori knowledge

Author

Kuhn, R. ; Nguyen, R. ; Junqua, J.C. ; Boman, R. ; Niedzielski, N. ; Fincke, S. ; Field, K. ; Contolini, M.

Author_Institution

Speech Technol. Lab., Panasonic Technol. Inc., Santa Barbara, CA, USA

Volume

2

fYear

1999

fDate

15-19 Mar 1999

Firstpage

749

Abstract

Previously, we presented a radically new class of fast adaptation techniques for speech recognition, based on prior knowledge of speaker variation. To obtain this prior knowledge, one applies a dimensionality reduction technique to T vectors of dimension D derived from T speaker-dependent (SD) models. This offline step yields T basis vectors, the eigenvoices. We constrain the model for new speaker S to be located in the space spanned by the first K eigenvoices. Speaker adaptation involves estimating K eigenvoice coefficients for the new speaker; typically, K is very small compared to original dimension D. Here, we review how to find the eigenvoices, give a maximum-likelihood estimator for the new speaker´s eigenvoice coefficients, and summarize mean adaptation experiments carried out on the Isolet database. We present new results which assess the impact on performance of changes in training of the SD models. Finally, we interpret the first few eigenvoices obtained

Keywords

adaptive signal processing; maximum likelihood estimation; principal component analysis; speech recognition; Isolet database; a priori knowledge; basis vectors; dimensionality reduction technique; eigenvoice coefficients; fast speaker adaptation; maximum-likelihood estimator; mean adaptation experiments; offline step; performance; principal component analysis; speaker variation; speaker-dependent models; speech recognition; training; vector dimension; Databases; Hidden Markov models; Independent component analysis; Laboratories; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Speech recognition; Surfaces; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on

Conference_Location

Phoenix, AZ

ISSN

1520-6149

Print_ISBN

0-7803-5041-3

Type

conf

DOI

10.1109/ICASSP.1999.759776

Filename

759776