DocumentCode :
417167
Title :
Enrollment in low-resource speech recognition systems
Author :
Deligne, Sabine ; Dharanipragada, Satya
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
We consider the problem of enrollment for low-resource speech recognition systems designed for noisy environments. Noise robustness concerns, memory and computational constraints, along with the use of compact acoustic models for fast Gaussian computation, make adaptation especially challenging. We derive a maximum a posteriori (MAP) algorithm especially designed for the fast off-line adaptation of these compact acoustic models. It requires less computation and memory than standard feature-space maximum likelihood linear regression (FMLLR) which is another technique well suited for compact acoustic models. In our experiments of speaker enrollment for speech recognition in the car, we present a computationally efficient procedure to simulate noisy conditions with the adaptation data. In these experiments, MAP compares favorably with FMLLR in terms of recognition accuracy. Besides, combining FMLLR and MAP significantly outperforms each technique individually, thus providing an efficient alternative for systems with larger resources.
Keywords :
acoustic noise; acoustic signal processing; maximum likelihood estimation; random noise; speech recognition; Gaussian computation; MAP algorithm; compact acoustic models; enrollment; feature-space MLLR; feature-space maximum likelihood linear regression; low-resource speech recognition systems; maximum a posteriori algorithm; Acoustic noise; Algorithm design and analysis; Computational modeling; Gaussian noise; Loudspeakers; Maximum likelihood linear regression; Memory management; Noise robustness; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325992
Filename :
1325992
Link To Document :
بازگشت