• DocumentCode
    3268528
  • Title

    2D psychoacoustic filtering for robust speech recognition

  • Author

    Dai, Peng ; Soon, Ing Yann ; Yeo, Chai Kiat

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2009
  • fDate
    8-10 Dec. 2009
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    One of the weaknesses of speech recognition system is its lack of robustness to background noise as compared to human listeners under similarly conditions. This paper proposes a 2D psychoacoustic modeling algorithm which is integrated with a feature extraction front-end for hidden Markov model (HMM). The proposed algorithm incorporates the properties of human auditory system and applies it to the speech recognition system to enhance its robustness. It integrates forward masking, lateral inhibition and cepstral mean normalization into ordinary mel-frequency cepstral coefficients (MFCC) feature extraction algorithm. Experiments carried out on AURORA2 database show that the word recognition rate can be improved significantly at low computational cost.
  • Keywords
    acoustic filters; cepstral analysis; feature extraction; filtering theory; hidden Markov models; speech intelligibility; speech recognition; 2D psychoacoustic filtering; background noise; cepstral mean normalization; feature extraction; forward masking; hidden Markov model; human auditory system; lateral inhibition; mel-frequency cepstral coefficient; robust speech recognition; word recognition rate; Background noise; Cepstral analysis; Feature extraction; Filtering; Hidden Markov models; Humans; Mel frequency cepstral coefficient; Noise robustness; Psychology; Speech recognition; 2D Mask; Automatic Speech Recognition; Simultaneous Masking; Temporal Masking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information, Communications and Signal Processing, 2009. ICICS 2009. 7th International Conference on
  • Conference_Location
    Macau
  • Print_ISBN
    978-1-4244-4656-8
  • Electronic_ISBN
    978-1-4244-4657-5
  • Type

    conf

  • DOI
    10.1109/ICICS.2009.5397502
  • Filename
    5397502