DocumentCode
3536379
Title
Improving the filter bank of a classic speech feature extraction algorithm
Author
Skowronski, Mark D. ; Harris, John G.
Author_Institution
Computational Neuro-Eng. Lab, Florida Univ., Gainesville, FL, USA
Volume
4
fYear
2003
fDate
25-28 May 2003
Abstract
The most popular speech feature extractor used in automatic speech recognition (ASR) systems today is the mel frequency cepstral coefficient (MFCC) algorithm. Introduced in 1980, the filter bank-based algorithm eventually replaced linear prediction cepstral coefficients (LPCC) as the premier front end, primarily because of MFCC´s superior robustness to additive noise. However, MFCC does not approximate the critical bandwidth of the human auditory system. We propose a novel scheme for decoupling filter bandwidth from other filter bank parameters, and we demonstrate improved noise robustness over three versions of MFCC through HMM-based experiments with the English digits in various noise environments.
Keywords
cepstral analysis; feature extraction; filtering theory; hidden Markov models; noise; speech recognition; HMM-based experiments; MFCC algorithm; automatic speech recognition systems; filter bandwidth decoupling; filter bank-based algorithm; human auditory system; mel frequency cepstral coefficient algorithm; noise environments; noise robustness; speech feature extraction algorithm; Additive noise; Automatic speech recognition; Bandwidth; Cepstral analysis; Feature extraction; Filter bank; Mel frequency cepstral coefficient; Noise robustness; Nonlinear filters; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Print_ISBN
0-7803-7761-3
Type
conf
DOI
10.1109/ISCAS.2003.1205828
Filename
1205828
Link To Document