DocumentCode
2301680
Title
Robust speech recognition using adaptive noise threshold estimation and wavelet shrinkage
Author
Van Pham, Tuan ; Kubin, Gernot ; Rank, Erhard
Author_Institution
Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz
fYear
2008
fDate
4-6 June 2008
Firstpage
206
Lastpage
211
Abstract
We propose an improved noise reduction method for robust speech recognition based on a perceptually statistical wavelet filtering algorithm. Perceptual noise thresholds are estimated from the universal thresholds for each critical wavelet subband. Fast changes of background noise are tracked adaptively by improving our statistical percentile filtering method. Smoothed wavelet shrinkage is applied to enhance noisy wavelet coefficients. Performance of the proposed denoising algorithm is evaluated in terms of recognition performance under adverse noisy conditions such as car and factory environments. Furthermore, it is compared to recent speech enhancement methods embedded in different state-of-the-art speech recognizers. Overall results indicate that almost similar recognition performance is obtained on the AURORA3 SPEECHDAT-Car corpus as compared to the HTK recognizer using the advanced front-end while there is an improvement when testing with the Loquendo recognizer on the SNOW-Factory corpus.
Keywords
adaptive estimation; signal denoising; smoothing methods; speech enhancement; speech recognition; statistical analysis; wavelet transforms; denoising algorithm; noise reduction method; perceptual adaptive noise threshold estimation; perceptually statistical wavelet filtering algorithm; robust speech recognition; smoothed wavelet shrinkage; speech enhancement method; statistical percentile filtering method; Adaptive filters; Background noise; Filtering algorithms; Noise reduction; Noise robustness; Production facilities; Speech enhancement; Speech recognition; Wavelet coefficients; Working environment noise; critical subbands; noise reduction; percentile filter; speech recognition; wavelet shrinkage;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Electronics, 2008. ICCE 2008. Second International Conference on
Conference_Location
Hoi an
Print_ISBN
978-1-4244-2425-2
Electronic_ISBN
978-1-4244-2426-9
Type
conf
DOI
10.1109/CCE.2008.4578959
Filename
4578959
Link To Document