DocumentCode :
3163272
Title :
Latent variable speaker adaptation of Gaussian mixture weights and means
Author :
Zhang, Xueru ; Demuynck, Kris ; Van hamme, Hugo
Author_Institution :
Dept. of Electr. Eng., Katholieke Univ. Leuven, Leuven, Belgium
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4349
Lastpage :
4352
Abstract :
We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition systems, which adapts both the Gaussian means and the mixture weights. Gaussian means are expressed as a linear combination of eigenvoices estimated with principal component analysis. The non-negative Gaussian mixture weights are expressed as a linear combination of a set of latent vectors estimated with non-negative matrix factorization. Experiments on the Wall Street Journal database show that the combination of weight and mean adaptation consistently improves the performance compared to eigenvoice adaptation only. Improvements up to 5.8% relative word error rate reduction were observed with 40 eigenvoices and 40 latent weight vectors. Furthermore, combining weight and mean adaptation outperformed both weight and mean adaptation on itself, even if the latter uses more latent vectors.
Keywords :
Gaussian processes; matrix decomposition; speaker recognition; Gaussian means; eigenvoices; latent variable speaker adaptation algorithm; linear combination; nonnegative Gaussian mixture weights; nonnegative matrix factorization; principal com- ponent analysis; relative word error rate reduction; vocabulary speech recognition systems; wall street journal database; Acoustics; Adaptation models; Data models; Hidden Markov models; Silicon; Training; Vectors; eigenvoice and weight adaptation; fast speaker adaptation; latent variable method; non-negative matrix factorization; speaker adaptive training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288882
Filename :
6288882
Link To Document :
بازگشت