DocumentCode
2703739
Title
An Auditory Neural Feature Extraction Method for Robust Speech Recognition
Author
Wei Guo ; Liqing Zhang ; Bin Xia
Author_Institution
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
This paper proposes a neural mechanism motivated system to extract noise resistant features for robust speech recognition. We use nonnegative matrix factorization to construct two layers of auditory neurons which captures the essence of speech patterns. The responses of these neurons to speech are further processed to form an auditory neural cepstral coefficient (ANCC) representation for speech recognition. We test the robustness of ANCC feature on a 51-word corpus, with recognizers trained on clean speech in noisy conditions. Compared with MFCC, ANCC shows less performance degradation and achieves satisfactory recognition accuracies in both non-stationary noise and high noise level conditions.
Keywords
feature extraction; speech processing; speech recognition; auditory neural cepstral coefficient; auditory neural feature extraction method; neural mechanism; nonstationary noise; robust speech recognition; speech patterns; Cepstral analysis; Degradation; Feature extraction; Mel frequency cepstral coefficient; Neurons; Noise level; Noise robustness; Speech processing; Speech recognition; Testing; auditory system; feature extraction; robustness; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.367032
Filename
4218220
Link To Document