DocumentCode
3406544
Title
Incorporating frequency masking filtering in a standard MFCC feature extraction algorithm
Author
Zhu, Weizhong ; O´Shaughnessy, Douglas
Author_Institution
INRS-EMT, Quebec Univ., Montreal, Que., Canada
Volume
1
fYear
2004
fDate
31 Aug.-4 Sept. 2004
Firstpage
617
Abstract
Frequency masking filtering is introduced in a standard mel frequency cepstral coefficients (MFCC) feature extraction algorithm. It mimics a human masking mechanism to get more robust features when the input speech is distorted by various noises. The AURORA 2.0 database together with HTK speech recognition toolkits are used to evaluate the impact of the frequency masking filtering algorithm at various thresholds. It is shown that with the proper frequency masking coefficients, it can have about 6.59%, 6.01% and 1.20% relative performance improvements over standard MFCC for test A and test B and test C respectively, in clean-condition training. It works well on all eight different noise conditions. It has also proved to be effective when it is combined with other popular noise robust techniques, such as cepstral mean normalization. The proposed frequency masking filtering algorithm is fairly simple and it only requires a very small extra computation load.
Keywords
cepstral analysis; feature extraction; filtering theory; speech recognition; AURORA 2.0 database; MFCC feature extraction algorithm; cepstral mean normalization; frequency masking filtering; human masking mechanism; mel frequency cepstral coefficient; noise robust technique; speech recognition toolkit; Cepstral analysis; Feature extraction; Filtering algorithms; Humans; Mel frequency cepstral coefficient; Noise robustness; Spatial databases; Speech coding; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN
0-7803-8406-7
Type
conf
DOI
10.1109/ICOSP.2004.1452739
Filename
1452739
Link To Document