DocumentCode :
3406544
Title :
Incorporating frequency masking filtering in a standard MFCC feature extraction algorithm
Author :
Zhu, Weizhong ; O´Shaughnessy, Douglas
Author_Institution :
INRS-EMT, Quebec Univ., Montreal, Que., Canada
Volume :
1
fYear :
2004
fDate :
31 Aug.-4 Sept. 2004
Firstpage :
617
Abstract :
Frequency masking filtering is introduced in a standard mel frequency cepstral coefficients (MFCC) feature extraction algorithm. It mimics a human masking mechanism to get more robust features when the input speech is distorted by various noises. The AURORA 2.0 database together with HTK speech recognition toolkits are used to evaluate the impact of the frequency masking filtering algorithm at various thresholds. It is shown that with the proper frequency masking coefficients, it can have about 6.59%, 6.01% and 1.20% relative performance improvements over standard MFCC for test A and test B and test C respectively, in clean-condition training. It works well on all eight different noise conditions. It has also proved to be effective when it is combined with other popular noise robust techniques, such as cepstral mean normalization. The proposed frequency masking filtering algorithm is fairly simple and it only requires a very small extra computation load.
Keywords :
cepstral analysis; feature extraction; filtering theory; speech recognition; AURORA 2.0 database; MFCC feature extraction algorithm; cepstral mean normalization; frequency masking filtering; human masking mechanism; mel frequency cepstral coefficient; noise robust technique; speech recognition toolkit; Cepstral analysis; Feature extraction; Filtering algorithms; Humans; Mel frequency cepstral coefficient; Noise robustness; Spatial databases; Speech coding; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN :
0-7803-8406-7
Type :
conf
DOI :
10.1109/ICOSP.2004.1452739
Filename :
1452739
Link To Document :
بازگشت