Title :
A vector statistical piecewise polynomial approximation algorithm for environment compensation in telephone LVCSR
Author :
Han, Zhaobing ; Zhang, Shuwu ; Zhang, Huayun ; Xu, Bo
Author_Institution :
Inst. of Autom., Acad. Sinica, Beijing, China
Abstract :
A vector statistical piecewise polynomial (VPP) approximation algorithm is proposed for environment compensation in speech signals that are degraded by both additive and convolutive noise. By investigating the model of the telephone environment, we address a piecewise polynomial, namely two linear polynomials and a quadratic polynomial, to approximate the environment function precisely. The VPP is applied either to stationary noise, or to non-stationary noise. In the first case, batch EM is used in the log-spectral domain; in the second case, recursive EM with iterative stochastic approximation is developed in the cepstral domain. Both approaches are based on the minimum mean squared error (MMSE) sense. Experimental results are presented on the application of this approach in improving the performance of Mandarin large vocabulary continuous speech recognition (LVCSR) in background noise and different transmission channels (such as fixed telephone line and GSM). The method can reduce the average character error rate (CER) by about 18%.
Keywords :
acoustic noise; error statistics; iterative methods; least mean squares methods; natural languages; optimisation; piecewise polynomial techniques; speech enhancement; speech recognition; statistical analysis; stochastic processes; telephony; MMSE; Mandarin large vocabulary continuous speech recognition; additive noise; batch EM; cepstral domain; character error rate; convolutive noise; environment compensation; iterative stochastic approximation; linear polynomials; log-spectral domain; minimum mean squared error; nonstationary noise; quadratic polynomial; recursive EM; speech signals; stationary noise; telephone LVCSR; vector statistical piecewise polynomial approximation; Additive noise; Approximation algorithms; Cepstral analysis; Degradation; Polynomials; Speech enhancement; Stochastic resonance; Telephony; Vectors; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1202308