Title of article :
Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
Author/Authors :
Deng، Kung-Li نويسنده , , J.، Droppo, نويسنده , , Acero، A نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2003
Pages :
-567
From page :
568
To page :
0
Abstract :
We describe a novel algorithm for recursive estimation of nonstationary acoustic noise which corrupts clean speech, and a successful application of the algorithm in the speech feature enhancement framework of noisenormalized SPLICE for robust speech recognition. The noise estimation algorithm makes use of a nonlinear model of the acoustic environment in the cepstral domain. Central to the algorithm is the innovative iterative stochastic approximation technique that improves piecewise linear approximation to the nonlinearity involved and that subsequently increases the accuracy for noise estimation. We report comprehensive experiments on SPLICEbased, noise-robust speech recognition for the AURORA2 task using the results of iterative stochastic approximation. The effectiveness of the new technique is demonstrated in comparison with a more traditional, MMSE noise estimation algorithm under otherwise identical conditions. The word error rate reduction achieved by iterative stochastic approximation for recursive noise estimation in the framework of noise-normalized SPLICE is 27.9% for the multicondition training mode, and 67.4% for the clean-only training mode, respectively, compared with the results using the standard cepstra with no speech enhancement and using the baseline HMM supplied by AURORA2. These represent the best performance in the clean-training category of the September-2001 AURORA2 evaluation. The relative error rate reduction achieved by using the same noise estimate is increased to 48.40% and 76.86%, respectively, for the two training modes after using a better designed HMM system. The experimental results demonstrated the crucial importance of using the newly introduced iterations in improving the earlier stochastic approximation technique, and showed sensitivity of the noise estimation algorithmʹs performance to the forgetting factor embedded in the algorithm.
Keywords :
Laminated waveguide , low-temperature co-fired ceramic (LTCC) , rectangular waveguide (RWG) , waveguide transition , millimeter wave
Journal title :
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
Serial Year :
2003
Journal title :
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
Record number :
86932
Link To Document :
بازگشت