• DocumentCode
    2964047
  • Title

    SNR features for automatic speech recognition

  • Author

    Garner, Philip N.

  • Author_Institution
    Idiap Res. Inst., Martigny, Switzerland
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    182
  • Lastpage
    187
  • Abstract
    When combined with cepstral normalisation techniques, the features normally used in Automatic Speech Recognition are based on Signal to Noise Ratio (SNR). We show that calculating SNR from the outset, rather than relying on cepstral normalisation to produce it, gives features with a number of practical and mathematical advantages over power-spectral based ones. In a detailed analysis, we derive Maximum Likelihood and Maximum a-Posteriori estimates for SNR based features, and show that they can outperform more conventional ones, especially when subsequently combined with cepstral variance normalisation. We further show anecdotal evidence that SNR based features lend themselves well to noise estimates based on low-energy envelope tracking.
  • Keywords
    cepstral analysis; feature extraction; maximum likelihood estimation; signal denoising; speech recognition; SNR features; automatic speech recognition; cepstral variance normalisation techniques; low-energy envelope tracking; maximum a-posteriori estimation; maximum likelihood estimation; power-spectral normalisation; signal to noise ratio; Additive noise; Automatic speech recognition; Background noise; Cepstral analysis; Convolution; Noise robustness; Noise shaping; Signal processing; Signal to noise ratio; Speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5372895
  • Filename
    5372895