Title :
Improving the filter bank of a classic speech feature extraction algorithm
Author :
Skowronski, Mark D. ; Harris, John G.
Author_Institution :
Computational Neuro-Eng. Lab, Florida Univ., Gainesville, FL, USA
Abstract :
The most popular speech feature extractor used in automatic speech recognition (ASR) systems today is the mel frequency cepstral coefficient (MFCC) algorithm. Introduced in 1980, the filter bank-based algorithm eventually replaced linear prediction cepstral coefficients (LPCC) as the premier front end, primarily because of MFCC´s superior robustness to additive noise. However, MFCC does not approximate the critical bandwidth of the human auditory system. We propose a novel scheme for decoupling filter bandwidth from other filter bank parameters, and we demonstrate improved noise robustness over three versions of MFCC through HMM-based experiments with the English digits in various noise environments.
Keywords :
cepstral analysis; feature extraction; filtering theory; hidden Markov models; noise; speech recognition; HMM-based experiments; MFCC algorithm; automatic speech recognition systems; filter bandwidth decoupling; filter bank-based algorithm; human auditory system; mel frequency cepstral coefficient algorithm; noise environments; noise robustness; speech feature extraction algorithm; Additive noise; Automatic speech recognition; Bandwidth; Cepstral analysis; Feature extraction; Filter bank; Mel frequency cepstral coefficient; Noise robustness; Nonlinear filters; Working environment noise;
Conference_Titel :
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Print_ISBN :
0-7803-7761-3
DOI :
10.1109/ISCAS.2003.1205828