DocumentCode
900276
Title
Quantile based histogram equalization for noise robust large vocabulary speech recognition
Author
Hilger, Florian ; Ney, Hermann
Author_Institution
Telenet GmbH Kommunikationsysteme, Munich, Germany
Volume
14
Issue
3
fYear
2006
fDate
5/1/2006 12:00:00 AM
Firstpage
845
Lastpage
854
Abstract
The noise robustness of automatic speech recognition systems can be improved by reducing an eventual mismatch between the training and test data distributions during feature extraction. Based on the quantiles of these distributions the parameters of transformation functions can be reliably estimated with small amounts of data. This paper will give a detailed review of quantile equalization applied to the Mel scaled filter bank, including considerations about the application in online systems and improvements through a second transformation step that combines neighboring filter channels. The recognition tests have shown that previous experimental observations on small vocabulary recognition tasks can be confirmed on the larger vocabulary Aurora 4 noisy Wall Street Journal database. The word error rate could be reduced from 45.7% to 25.5% (clean training) and from 19.5% to 17.0% (multicondition training).
Keywords
channel bank filters; equalisers; speech recognition; Mel scaled filter bank; automatic speech recognition; filter channels; noise robustness; quantile based histogram equalization; vocabulary speech recognition; Automatic speech recognition; Automatic testing; Feature extraction; Filter bank; Histograms; Noise reduction; Noise robustness; Speech recognition; System testing; Vocabulary; Feature extraction; histogram normalization; noise robust speech recognition; quantile equalization;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TSA.2005.857792
Filename
1621198
Link To Document