DocumentCode :
2065301
Title :
Deriving MFCC Parameters from the Dynamic Spectrum for Robust Speech Recognition
Author :
Zheng, Nengheng ; Li, Xia ; Cao, Houwei ; Lee, Tan ; Ching, P.C.
Author_Institution :
Coll. of Inf. Eng., Shenzhen Univ., Shenzhen, China
fYear :
2008
fDate :
16-19 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
State-of-the-art automatic speech recognition systems typically adopt the feature set containing mel-frequency cepstral coefficients (MFCC) and their time derivatives. The noise vulnerability of MFCC significantly degrades the recognition performance of such systems in noisy conditions. This paper describes a noise-robust feature extraction method. A set of new MFCC features is derived from the dynamic spectrum instead of the static spectrum as in the conventional MFCC feature extraction. It is shown that the dynamic spectrum preserves the spectral envelope information and, at the same time, is more noise resistant than the static spectrum. Experiments on Aurora 2 database show the noise robustness of the proposed features and it is preferable to replace MFCC with the new features in the state-of-the-art feature set.
Keywords :
feature extraction; speech recognition; Aurora 2 database; dynamic spectrum; mel-frequency cepstral coefficients; noise vulnerability; noise-robust feature extraction method; spectral envelope information; state-of-the-art automatic speech recognition systems; static spectrum; time derivatives; Acoustic distortion; Acoustic noise; Additive noise; Cepstral analysis; Feature extraction; Mel frequency cepstral coefficient; Noise robustness; Signal to noise ratio; Speech enhancement; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
Type :
conf
DOI :
10.1109/CHINSL.2008.ECP.33
Filename :
4730287
Link To Document :
بازگشت