DocumentCode :
2703739
Title :
An Auditory Neural Feature Extraction Method for Robust Speech Recognition
Author :
Wei Guo ; Liqing Zhang ; Bin Xia
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This paper proposes a neural mechanism motivated system to extract noise resistant features for robust speech recognition. We use nonnegative matrix factorization to construct two layers of auditory neurons which captures the essence of speech patterns. The responses of these neurons to speech are further processed to form an auditory neural cepstral coefficient (ANCC) representation for speech recognition. We test the robustness of ANCC feature on a 51-word corpus, with recognizers trained on clean speech in noisy conditions. Compared with MFCC, ANCC shows less performance degradation and achieves satisfactory recognition accuracies in both non-stationary noise and high noise level conditions.
Keywords :
feature extraction; speech processing; speech recognition; auditory neural cepstral coefficient; auditory neural feature extraction method; neural mechanism; nonstationary noise; robust speech recognition; speech patterns; Cepstral analysis; Degradation; Feature extraction; Mel frequency cepstral coefficient; Neurons; Noise level; Noise robustness; Speech processing; Speech recognition; Testing; auditory system; feature extraction; robustness; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367032
Filename :
4218220
Link To Document :
بازگشت