Title :
Recursive estimation of time-varying environments for robust speech recognition
Author :
Zhao, Yunxin ; Wang, Shaojun ; Yen, Kuan-Chieh
Author_Institution :
Dept. of CECS, Missouri Univ., Columbia, MO, USA
Abstract :
An EM-type of recursive estimation algorithm is formulated in the DFT domain for joint estimation of time-varying parameters of distortion channel and additive noise from online degraded speech. Speech features are estimated from the posterior estimates of short-time speech power spectra in an on-the-fly fashion. Experiments were performed on speaker-independent continuous speech recognition using features of perceptually based linear prediction cepstral coefficients, log energy, and temporal regression coefficients. Speech data were taken from the TIMIT database and were degraded by simulated time-varying channel and noise. Experimental results showed significant improvement in recognition word accuracy due to the proposed recursive estimation as compared with the results from direct recognition using a baseline system and from performing speech feature estimation using a batch EM algorithm
Keywords :
AWGN; FIR filters; discrete Fourier transforms; least mean squares methods; linear predictive coding; noise; recursive estimation; speech recognition; telecommunication channels; DFT domain; TIMIT database; additive noise; baseline system; batch EM algorithm; direct recognition; discrete Fourier transform; distortion channel; log energy; online degraded speech; perceptually based linear prediction cepstral coefficients; posterior estimates; recognition word accuracy; recursive estimation; robust speech recognition; short-time speech power spectra; speaker-independent continuous speech recognition; speech feature estimation; temporal regression coefficients; time-varying environments; Additive noise; Cepstral analysis; Degradation; Noise robustness; Parameter estimation; Recursive estimation; Spatial databases; Speech enhancement; Speech recognition; Time-varying channels;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940808