DocumentCode :
3013090
Title :
A linear predictive front-end processor for speech recognition in noisy environments
Author :
Ephraim, Yariv ; Wilpon, Jay G. ; Rabiner, Lawrence R.
Author_Institution :
AT&T Bell Laboratories, Murray Hill, NJ
Volume :
12
fYear :
1987
fDate :
31868
Firstpage :
1324
Lastpage :
1327
Abstract :
We investigate the performance of a recent algorithm for linear predictive (LP) modeling of speech signals, which have been degraded by uncorrelated additive noise, as a front-end processor in a speech recognition system. The system is speaker dependent, and recognizes isolated words, based on dynamic time warping principles. The LP model for the clean speech is estimated through appropriate composite modeling of the noisy speech. This is done by minimizing the Itakura-Saito distortion measure between the sample spectrum of the noisy speech and the power spectral density of the composite model. This approach results in a "filtering-modeling" scheme in which the filter for the noisy speech, and the LP model for the clean speech, are alternatively optimized. The proposed system was tested using the 26 word English alphabet, the ten English digits, and the three command words, "stop," "error," and "repeat," which were contaminated by additive white noise at 5-20 dB signal to noise ratios (SNR\´s). By replacing the standard LP analysis with the proposed algorithm, during training on the clean speech and testing on the noisy speech, we achieve an improvement in recognition accuracy equivalent to an increase in input SNR of approximately 10 dB.
Keywords :
Distortion measurement; Power system modeling; Prediction algorithms; Predictive models; Signal to noise ratio; Speech analysis; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type :
conf
DOI :
10.1109/ICASSP.1987.1169458
Filename :
1169458
Link To Document :
بازگشت