Model-space compensation of microphone and noise for speaker-independent speech recognition

Author

Gong, Yifan

Author_Institution

Speech Technol. Lab., Texas Instrum. Inc., Dallas, TX, USA

Volume

1

fYear

2003

fDate

6-10 April 2003

Abstract

Ambient noise (additive distortion) and microphone changes (convolutive distortion) are two sources of distortion that may severely degrade speech recognition performance in real operating environments. Simultaneously modeling the two distortion sources has been a great challenge for robust speech recognition. A method, called JAC (Joint compensation of Additive and Convolutive distortions), is presented. It uses two log-spectral domain components in speech acoustic models to represent additive and convolutive distortions. The method adapts HMM mean vectors with a noise estimate and a channel estimate. The noise estimate is calculated from the pre-utterance pause and the channel estimate is calculated using an EM algorithm from speech utterances produced in the distortion environment. Evaluated on a noisy speech database recorded in-vehicle with a hands-free distant microphone in several driving conditions, the algorithm reduces recognition word error rate in typical operating conditions by an order of magnitude.

Keywords

acoustic distortion; acoustic noise; acoustic signal processing; channel estimation; compensation; error statistics; hidden Markov models; optimisation; speech recognition; EM algorithm; HMM mean vectors; additive distortion; ambient noise; channel estimation; convolutive distortion; log-spectral domain; microphone changes; model-space compensation; noise estimation; pre-utterance pause; recognition word error rate; speaker-independent speech recognition; speech acoustic models; Acoustic distortion; Acoustic noise; Additive noise; Degradation; Hidden Markov models; Microphones; Noise robustness; Speech enhancement; Speech recognition; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-7663-3

Type

conf

DOI

10.1109/ICASSP.2003.1198867

Filename

1198867