• DocumentCode
    2703739
  • Title

    An Auditory Neural Feature Extraction Method for Robust Speech Recognition

  • Author

    Wei Guo ; Liqing Zhang ; Bin Xia

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    This paper proposes a neural mechanism motivated system to extract noise resistant features for robust speech recognition. We use nonnegative matrix factorization to construct two layers of auditory neurons which captures the essence of speech patterns. The responses of these neurons to speech are further processed to form an auditory neural cepstral coefficient (ANCC) representation for speech recognition. We test the robustness of ANCC feature on a 51-word corpus, with recognizers trained on clean speech in noisy conditions. Compared with MFCC, ANCC shows less performance degradation and achieves satisfactory recognition accuracies in both non-stationary noise and high noise level conditions.
  • Keywords
    feature extraction; speech processing; speech recognition; auditory neural cepstral coefficient; auditory neural feature extraction method; neural mechanism; nonstationary noise; robust speech recognition; speech patterns; Cepstral analysis; Degradation; Feature extraction; Mel frequency cepstral coefficient; Neurons; Noise level; Noise robustness; Speech processing; Speech recognition; Testing; auditory system; feature extraction; robustness; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.367032
  • Filename
    4218220