• DocumentCode
    2301680
  • Title

    Robust speech recognition using adaptive noise threshold estimation and wavelet shrinkage

  • Author

    Van Pham, Tuan ; Kubin, Gernot ; Rank, Erhard

  • Author_Institution
    Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz
  • fYear
    2008
  • fDate
    4-6 June 2008
  • Firstpage
    206
  • Lastpage
    211
  • Abstract
    We propose an improved noise reduction method for robust speech recognition based on a perceptually statistical wavelet filtering algorithm. Perceptual noise thresholds are estimated from the universal thresholds for each critical wavelet subband. Fast changes of background noise are tracked adaptively by improving our statistical percentile filtering method. Smoothed wavelet shrinkage is applied to enhance noisy wavelet coefficients. Performance of the proposed denoising algorithm is evaluated in terms of recognition performance under adverse noisy conditions such as car and factory environments. Furthermore, it is compared to recent speech enhancement methods embedded in different state-of-the-art speech recognizers. Overall results indicate that almost similar recognition performance is obtained on the AURORA3 SPEECHDAT-Car corpus as compared to the HTK recognizer using the advanced front-end while there is an improvement when testing with the Loquendo recognizer on the SNOW-Factory corpus.
  • Keywords
    adaptive estimation; signal denoising; smoothing methods; speech enhancement; speech recognition; statistical analysis; wavelet transforms; denoising algorithm; noise reduction method; perceptual adaptive noise threshold estimation; perceptually statistical wavelet filtering algorithm; robust speech recognition; smoothed wavelet shrinkage; speech enhancement method; statistical percentile filtering method; Adaptive filters; Background noise; Filtering algorithms; Noise reduction; Noise robustness; Production facilities; Speech enhancement; Speech recognition; Wavelet coefficients; Working environment noise; critical subbands; noise reduction; percentile filter; speech recognition; wavelet shrinkage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Electronics, 2008. ICCE 2008. Second International Conference on
  • Conference_Location
    Hoi an
  • Print_ISBN
    978-1-4244-2425-2
  • Electronic_ISBN
    978-1-4244-2426-9
  • Type

    conf

  • DOI
    10.1109/CCE.2008.4578959
  • Filename
    4578959