• DocumentCode
    3406544
  • Title

    Incorporating frequency masking filtering in a standard MFCC feature extraction algorithm

  • Author

    Zhu, Weizhong ; O´Shaughnessy, Douglas

  • Author_Institution
    INRS-EMT, Quebec Univ., Montreal, Que., Canada
  • Volume
    1
  • fYear
    2004
  • fDate
    31 Aug.-4 Sept. 2004
  • Firstpage
    617
  • Abstract
    Frequency masking filtering is introduced in a standard mel frequency cepstral coefficients (MFCC) feature extraction algorithm. It mimics a human masking mechanism to get more robust features when the input speech is distorted by various noises. The AURORA 2.0 database together with HTK speech recognition toolkits are used to evaluate the impact of the frequency masking filtering algorithm at various thresholds. It is shown that with the proper frequency masking coefficients, it can have about 6.59%, 6.01% and 1.20% relative performance improvements over standard MFCC for test A and test B and test C respectively, in clean-condition training. It works well on all eight different noise conditions. It has also proved to be effective when it is combined with other popular noise robust techniques, such as cepstral mean normalization. The proposed frequency masking filtering algorithm is fairly simple and it only requires a very small extra computation load.
  • Keywords
    cepstral analysis; feature extraction; filtering theory; speech recognition; AURORA 2.0 database; MFCC feature extraction algorithm; cepstral mean normalization; frequency masking filtering; human masking mechanism; mel frequency cepstral coefficient; noise robust technique; speech recognition toolkit; Cepstral analysis; Feature extraction; Filtering algorithms; Humans; Mel frequency cepstral coefficient; Noise robustness; Spatial databases; Speech coding; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
  • Print_ISBN
    0-7803-8406-7
  • Type

    conf

  • DOI
    10.1109/ICOSP.2004.1452739
  • Filename
    1452739