DocumentCode
394329
Title
Model-space compensation of microphone and noise for speaker-independent speech recognition
Author
Gong, Yifan
Author_Institution
Speech Technol. Lab., Texas Instrum. Inc., Dallas, TX, USA
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
Ambient noise (additive distortion) and microphone changes (convolutive distortion) are two sources of distortion that may severely degrade speech recognition performance in real operating environments. Simultaneously modeling the two distortion sources has been a great challenge for robust speech recognition. A method, called JAC (Joint compensation of Additive and Convolutive distortions), is presented. It uses two log-spectral domain components in speech acoustic models to represent additive and convolutive distortions. The method adapts HMM mean vectors with a noise estimate and a channel estimate. The noise estimate is calculated from the pre-utterance pause and the channel estimate is calculated using an EM algorithm from speech utterances produced in the distortion environment. Evaluated on a noisy speech database recorded in-vehicle with a hands-free distant microphone in several driving conditions, the algorithm reduces recognition word error rate in typical operating conditions by an order of magnitude.
Keywords
acoustic distortion; acoustic noise; acoustic signal processing; channel estimation; compensation; error statistics; hidden Markov models; optimisation; speech recognition; EM algorithm; HMM mean vectors; additive distortion; ambient noise; channel estimation; convolutive distortion; log-spectral domain; microphone changes; model-space compensation; noise estimation; pre-utterance pause; recognition word error rate; speaker-independent speech recognition; speech acoustic models; Acoustic distortion; Acoustic noise; Additive noise; Degradation; Hidden Markov models; Microphones; Noise robustness; Speech enhancement; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198867
Filename
1198867
Link To Document