Title :
An Auditory Neural Feature Extraction Method for Robust Speech Recognition
Author :
Wei Guo ; Liqing Zhang ; Bin Xia
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
Abstract :
This paper proposes a neural mechanism motivated system to extract noise resistant features for robust speech recognition. We use nonnegative matrix factorization to construct two layers of auditory neurons which captures the essence of speech patterns. The responses of these neurons to speech are further processed to form an auditory neural cepstral coefficient (ANCC) representation for speech recognition. We test the robustness of ANCC feature on a 51-word corpus, with recognizers trained on clean speech in noisy conditions. Compared with MFCC, ANCC shows less performance degradation and achieves satisfactory recognition accuracies in both non-stationary noise and high noise level conditions.
Keywords :
feature extraction; speech processing; speech recognition; auditory neural cepstral coefficient; auditory neural feature extraction method; neural mechanism; nonstationary noise; robust speech recognition; speech patterns; Cepstral analysis; Degradation; Feature extraction; Mel frequency cepstral coefficient; Neurons; Noise level; Noise robustness; Speech processing; Speech recognition; Testing; auditory system; feature extraction; robustness; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367032