DocumentCode
2065301
Title
Deriving MFCC Parameters from the Dynamic Spectrum for Robust Speech Recognition
Author
Zheng, Nengheng ; Li, Xia ; Cao, Houwei ; Lee, Tan ; Ching, P.C.
Author_Institution
Coll. of Inf. Eng., Shenzhen Univ., Shenzhen, China
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
State-of-the-art automatic speech recognition systems typically adopt the feature set containing mel-frequency cepstral coefficients (MFCC) and their time derivatives. The noise vulnerability of MFCC significantly degrades the recognition performance of such systems in noisy conditions. This paper describes a noise-robust feature extraction method. A set of new MFCC features is derived from the dynamic spectrum instead of the static spectrum as in the conventional MFCC feature extraction. It is shown that the dynamic spectrum preserves the spectral envelope information and, at the same time, is more noise resistant than the static spectrum. Experiments on Aurora 2 database show the noise robustness of the proposed features and it is preferable to replace MFCC with the new features in the state-of-the-art feature set.
Keywords
feature extraction; speech recognition; Aurora 2 database; dynamic spectrum; mel-frequency cepstral coefficients; noise vulnerability; noise-robust feature extraction method; spectral envelope information; state-of-the-art automatic speech recognition systems; static spectrum; time derivatives; Acoustic distortion; Acoustic noise; Additive noise; Cepstral analysis; Feature extraction; Mel frequency cepstral coefficient; Noise robustness; Signal to noise ratio; Speech enhancement; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2942-4
Electronic_ISBN
978-1-4244-2943-1
Type
conf
DOI
10.1109/CHINSL.2008.ECP.33
Filename
4730287
Link To Document