Title :
MFCC based robust features for English word Recognition
Author :
Lokhande, N.N. ; Nehe, N.S. ; Vikhe, P.S.
Author_Institution :
Pravara Rural Eng. Coll. (PREC), Ahmednagar, India
Abstract :
In this paper we propose robust features of MFCC for speech recognition in noisy environments. The state-of art Mel-frequency Cepstral Coefficients technique were extensively used in speech Recognition. The MFCC are derived from speech frames by using Fast Fourier Transform and Discrete Cosine Transform. Performance of MFCC features is degrading in noisy environments. These features further normalized to get new set of features known as Cepstral Mean Normalized (CMN) features. Normalized features improves recognition rate in presence of additive noise. Performance of these noise robust features evaluated on own created English digit database using Vector Quantization technique. The effectiveness of the new feature set is demonstrated by the results of both Speaker-Independent and speaker-dependent recognition tasks in presence of white Gaussian noise.
Keywords :
AWGN; discrete cosine transforms; fast Fourier transforms; speaker recognition; speech coding; vector quantisation; CMN features; English digit database; English word recognition; MFCC based robust features; Mel-frequency cepstral coefficients technique; additive noise; cepstral mean normalized feature; discrete cosine transform; fast Fourier transform; noise robust features; speaker-dependent recognition task; speaker-independent recognition tasks; speech frames; speech recognition; vector quantization technique; white Gaussian noise; Feature extraction; Mel frequency cepstral coefficient; Robustness; Speech; Speech recognition; Vectors; Cepstral Mean Normalization; Feature Extraction; Mel Frequency Cepstral Coefficient; Vector Quantization;
Conference_Titel :
India Conference (INDICON), 2012 Annual IEEE
Conference_Location :
Kochi
Print_ISBN :
978-1-4673-2270-6
DOI :
10.1109/INDCON.2012.6420726