• DocumentCode
    2065301
  • Title

    Deriving MFCC Parameters from the Dynamic Spectrum for Robust Speech Recognition

  • Author

    Zheng, Nengheng ; Li, Xia ; Cao, Houwei ; Lee, Tan ; Ching, P.C.

  • Author_Institution
    Coll. of Inf. Eng., Shenzhen Univ., Shenzhen, China
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    State-of-the-art automatic speech recognition systems typically adopt the feature set containing mel-frequency cepstral coefficients (MFCC) and their time derivatives. The noise vulnerability of MFCC significantly degrades the recognition performance of such systems in noisy conditions. This paper describes a noise-robust feature extraction method. A set of new MFCC features is derived from the dynamic spectrum instead of the static spectrum as in the conventional MFCC feature extraction. It is shown that the dynamic spectrum preserves the spectral envelope information and, at the same time, is more noise resistant than the static spectrum. Experiments on Aurora 2 database show the noise robustness of the proposed features and it is preferable to replace MFCC with the new features in the state-of-the-art feature set.
  • Keywords
    feature extraction; speech recognition; Aurora 2 database; dynamic spectrum; mel-frequency cepstral coefficients; noise vulnerability; noise-robust feature extraction method; spectral envelope information; state-of-the-art automatic speech recognition systems; static spectrum; time derivatives; Acoustic distortion; Acoustic noise; Additive noise; Cepstral analysis; Feature extraction; Mel frequency cepstral coefficient; Noise robustness; Signal to noise ratio; Speech enhancement; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2942-4
  • Electronic_ISBN
    978-1-4244-2943-1
  • Type

    conf

  • DOI
    10.1109/CHINSL.2008.ECP.33
  • Filename
    4730287