DocumentCode :
1257612
Title :
Nonlinear compensation for stochastic matching
Author :
Surendran, Arun C. ; Lee, Chin-Hui ; Rahim, Mazin
Author_Institution :
Bell Labs., Lucent Technol., Murray Hill, NJ, USA
Volume :
7
Issue :
6
fYear :
1999
fDate :
11/1/1999 12:00:00 AM
Firstpage :
643
Lastpage :
655
Abstract :
The performance of an automatic speech recognizer degrades when there exists an acoustic mismatch between the training and the testing conditions in the data. Though it is certain that the mismatch is nonlinear, its exact form is unknown. Tackling the problem of nonlinear mismatches is a difficult task that has not been adequately addressed before. We develop an approach that uses nonlinear transformations in the stochastic matching framework to compensate for acoustic mismatches. The functional form of the nonlinear transformation is modeled by neural networks. We develop a new technique to train neural networks using the generalized EM algorithm. This technique eliminates the need for stereo databases, which are difficult to obtain in practical applications. The new technique is data-driven and hence can be used under a wide variety of conditions without a priori knowledge of the environment. Using this technique, we show that we can provide improvement under various types of acoustic mismatch; in some cases a 72% reduction in word error rate is achieved
Keywords :
acoustic signal processing; learning (artificial intelligence); neural nets; nonlinear functions; optimisation; speech recognition; stochastic processes; acoustic mismatch compensation; automatic speech recognizer performance; data-driven technique; generalized EM algorithm; neural network training; nonlinear compensation; nonlinear mismatch; nonlinear transformations; stereo databases; stochastic matching; testing conditions; training conditions; word error rate reduction; Acoustic distortion; Acoustic testing; Additive noise; Automatic speech recognition; Neural networks; Nonlinear distortion; Signal generators; Speech recognition; Stochastic processes; Training data;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.799689
Filename :
799689
Link To Document :
بازگشت