DocumentCode :
394329
Title :
Model-space compensation of microphone and noise for speaker-independent speech recognition
Author :
Gong, Yifan
Author_Institution :
Speech Technol. Lab., Texas Instrum. Inc., Dallas, TX, USA
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
Ambient noise (additive distortion) and microphone changes (convolutive distortion) are two sources of distortion that may severely degrade speech recognition performance in real operating environments. Simultaneously modeling the two distortion sources has been a great challenge for robust speech recognition. A method, called JAC (Joint compensation of Additive and Convolutive distortions), is presented. It uses two log-spectral domain components in speech acoustic models to represent additive and convolutive distortions. The method adapts HMM mean vectors with a noise estimate and a channel estimate. The noise estimate is calculated from the pre-utterance pause and the channel estimate is calculated using an EM algorithm from speech utterances produced in the distortion environment. Evaluated on a noisy speech database recorded in-vehicle with a hands-free distant microphone in several driving conditions, the algorithm reduces recognition word error rate in typical operating conditions by an order of magnitude.
Keywords :
acoustic distortion; acoustic noise; acoustic signal processing; channel estimation; compensation; error statistics; hidden Markov models; optimisation; speech recognition; EM algorithm; HMM mean vectors; additive distortion; ambient noise; channel estimation; convolutive distortion; log-spectral domain; microphone changes; model-space compensation; noise estimation; pre-utterance pause; recognition word error rate; speaker-independent speech recognition; speech acoustic models; Acoustic distortion; Acoustic noise; Additive noise; Degradation; Hidden Markov models; Microphones; Noise robustness; Speech enhancement; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198867
Filename :
1198867
Link To Document :
بازگشت