DocumentCode :
846138
Title :
Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
Author :
Deng, Li ; Droppo, Jasha ; Acero, Alex
Author_Institution :
Microsoft Res., Redmond, WA, USA
Volume :
11
Issue :
6
fYear :
2003
Firstpage :
568
Lastpage :
580
Abstract :
We describe a novel algorithm for recursive estimation of nonstationary acoustic noise which corrupts clean speech, and a successful application of the algorithm in the speech feature enhancement framework of noise-normalized SPLICE for robust speech recognition. The noise estimation algorithm makes use of a nonlinear model of the acoustic environment in the cepstral domain. Central to the algorithm is the innovative iterative stochastic approximation technique that improves piecewise linear approximation to the nonlinearity involved and that subsequently increases the accuracy for noise estimation. We report comprehensive experiments on SPLICE-based, noise-robust speech recognition for the AURORA2 task using the results of iterative stochastic approximation. The effectiveness of the new technique is demonstrated in comparison with a more traditional, MMSE noise estimation algorithm under otherwise identical conditions. The word error rate reduction achieved by iterative stochastic approximation for recursive noise estimation in the framework of noise-normalized SPLICE is 27.9% for the multicondition training mode, and 67.4% for the clean-only training mode, respectively, compared with the results using the standard cepstra with no speech enhancement and using the baseline HMM supplied by AURORA2. These represent the best performance in the clean-training category of the September-2001 AURORA2 evaluation. The relative error rate reduction achieved by using the same noise estimate is increased to 48.40% and 76.86%, respectively, for the two training modes after using a better designed HMM system. The experimental results demonstrated the crucial importance of using the newly introduced iterations in improving the earlier stochastic approximation technique, and showed sensitivity of the noise estimation algorithm´s performance to the forgetting factor embedded in the algorithm.
Keywords :
acoustic noise; cepstral analysis; hidden Markov models; iterative methods; least mean squares methods; piecewise linear techniques; recursive estimation; speech enhancement; speech recognition; stochastic processes; AURORA2 task; HMM system; MMSE; cepstral domain; clean-training category; hidden Markov model; iterative stochastic approximation; minimum mean squared error; noise-normalized SPLICE; nonlinear model; nonstationary acoustic noise; piecewise linear approximation; recursive noise estimation; speech recognition; stereo-based piecewise linear compensation for environment; Acoustic noise; Approximation algorithms; Iterative algorithms; Noise robustness; Piecewise linear approximation; Recursive estimation; Speech enhancement; Speech recognition; Stochastic resonance; Working environment noise;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2003.818076
Filename :
1255445
Link To Document :
بازگشت