DocumentCode :
2279272
Title :
Improved MFCC feature extraction by PCA-optimized filter-bank for speech recognition
Author :
Lee, Shang-Ming ; Fang, Shi-Hau ; Hung, Jeih-weih ; Lee, Lin-shan
Author_Institution :
Graduate Inst. of Comm. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
2001
fDate :
2001
Firstpage :
49
Lastpage :
52
Abstract :
Although Mel-frequency cepstral coefficients (MFCC) have been proven to perform very well under most conditions, some limited efforts have been made in optimizing the shape of the filters in the filter-bank in the conventional MFCC approach. This paper presents a new feature extraction approach that designs the shapes of the filters in the filter-bank. In this new approach, the filter-bank coefficients are data-driven and obtained by applying principal component analysis (PCA) to the FFT spectrum of the training data. The experimental results show that this method is robust under noisy environment and is well additive with other noise-handling techniques.
Keywords :
channel bank filters; fast Fourier transforms; feature extraction; learning (artificial intelligence); optimisation; principal component analysis; speech recognition; FFT spectrum; MFCC; Mel-frequency cepstral coefficients; PCA; feature extraction; filter-bank; principal component analysis; speech recognition; Additive noise; Cepstral analysis; Feature extraction; Filters; Mel frequency cepstral coefficient; Noise shaping; Principal component analysis; Shape; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
Type :
conf
DOI :
10.1109/ASRU.2001.1034586
Filename :
1034586
Link To Document :
بازگشت