DocumentCode
417167
Title
Enrollment in low-resource speech recognition systems
Author
Deligne, Sabine ; Dharanipragada, Satya
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
We consider the problem of enrollment for low-resource speech recognition systems designed for noisy environments. Noise robustness concerns, memory and computational constraints, along with the use of compact acoustic models for fast Gaussian computation, make adaptation especially challenging. We derive a maximum a posteriori (MAP) algorithm especially designed for the fast off-line adaptation of these compact acoustic models. It requires less computation and memory than standard feature-space maximum likelihood linear regression (FMLLR) which is another technique well suited for compact acoustic models. In our experiments of speaker enrollment for speech recognition in the car, we present a computationally efficient procedure to simulate noisy conditions with the adaptation data. In these experiments, MAP compares favorably with FMLLR in terms of recognition accuracy. Besides, combining FMLLR and MAP significantly outperforms each technique individually, thus providing an efficient alternative for systems with larger resources.
Keywords
acoustic noise; acoustic signal processing; maximum likelihood estimation; random noise; speech recognition; Gaussian computation; MAP algorithm; compact acoustic models; enrollment; feature-space MLLR; feature-space maximum likelihood linear regression; low-resource speech recognition systems; maximum a posteriori algorithm; Acoustic noise; Algorithm design and analysis; Computational modeling; Gaussian noise; Loudspeakers; Maximum likelihood linear regression; Memory management; Noise robustness; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325992
Filename
1325992
Link To Document