DocumentCode :
900276
Title :
Quantile based histogram equalization for noise robust large vocabulary speech recognition
Author :
Hilger, Florian ; Ney, Hermann
Author_Institution :
Telenet GmbH Kommunikationsysteme, Munich, Germany
Volume :
14
Issue :
3
fYear :
2006
fDate :
5/1/2006 12:00:00 AM
Firstpage :
845
Lastpage :
854
Abstract :
The noise robustness of automatic speech recognition systems can be improved by reducing an eventual mismatch between the training and test data distributions during feature extraction. Based on the quantiles of these distributions the parameters of transformation functions can be reliably estimated with small amounts of data. This paper will give a detailed review of quantile equalization applied to the Mel scaled filter bank, including considerations about the application in online systems and improvements through a second transformation step that combines neighboring filter channels. The recognition tests have shown that previous experimental observations on small vocabulary recognition tasks can be confirmed on the larger vocabulary Aurora 4 noisy Wall Street Journal database. The word error rate could be reduced from 45.7% to 25.5% (clean training) and from 19.5% to 17.0% (multicondition training).
Keywords :
channel bank filters; equalisers; speech recognition; Mel scaled filter bank; automatic speech recognition; filter channels; noise robustness; quantile based histogram equalization; vocabulary speech recognition; Automatic speech recognition; Automatic testing; Feature extraction; Filter bank; Histograms; Noise reduction; Noise robustness; Speech recognition; System testing; Vocabulary; Feature extraction; histogram normalization; noise robust speech recognition; quantile equalization;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.857792
Filename :
1621198
Link To Document :
بازگشت