DocumentCode
337479
Title
Correlation modeling of MLLR transform biases for rapid HMM adaptation to new speakers
Author
Bocchieri, Enrico ; Digalakis, Vassilis ; Corduneanu, Adrian ; Boulis, Costas
Author_Institution
AT&T Bell Labs., USA
Volume
2
fYear
1999
fDate
15-19 Mar 1999
Firstpage
773
Abstract
This paper concerns rapid adaptation of hidden Markov model (HMM) based speech recognizers to a new speaker, when only few speech samples (one minute or less) are available from the new speaker. A widely used family of adaptation algorithms defines adaptation as a linearly constrained reestimation of the HMM Gaussians. With few speech data, tight constraints must be introduced, by reducing the number of linear transforms and by specifying certain transform structures (e.g. block diagonal). We hypothesize that under these adaptation conditions, the residual errors of the adapted Gaussian parameters can be represented and corrected by dependency models, as estimated from a training corpus. Thus, after introducing a particular class of linear transforms, we develop correlation models of the transform parameters. In rapid adaptation experiments on the Switchboard corpus, the proposed algorithm performs better than the transform-constrained adaptation and the adaptation by correlation modeling of the HMM parameters, respectively
Keywords
Gaussian processes; correlation methods; hidden Markov models; maximum likelihood estimation; speech recognition; transforms; MLLR transform biases; adapted Gaussian parameters; correlation modeling; correlation models; dependency model; hidden Markov model; linear transforms; linearly constrained reestimation; new speakers; rapid HMM adaptation; residual errors; speech recognizers; transform parameters; transform structures; Adaptation model; Automatic speech recognition; Error correction; Gaussian processes; Hidden Markov models; Humans; Loudspeakers; Maximum likelihood linear regression; Predictive models; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.759784
Filename
759784
Link To Document