Title :
Incorporating frequency masking filtering in a standard MFCC feature extraction algorithm
Author :
Zhu, Weizhong ; O´Shaughnessy, Douglas
Author_Institution :
INRS-EMT, Quebec Univ., Montreal, Que., Canada
fDate :
31 Aug.-4 Sept. 2004
Abstract :
Frequency masking filtering is introduced in a standard mel frequency cepstral coefficients (MFCC) feature extraction algorithm. It mimics a human masking mechanism to get more robust features when the input speech is distorted by various noises. The AURORA 2.0 database together with HTK speech recognition toolkits are used to evaluate the impact of the frequency masking filtering algorithm at various thresholds. It is shown that with the proper frequency masking coefficients, it can have about 6.59%, 6.01% and 1.20% relative performance improvements over standard MFCC for test A and test B and test C respectively, in clean-condition training. It works well on all eight different noise conditions. It has also proved to be effective when it is combined with other popular noise robust techniques, such as cepstral mean normalization. The proposed frequency masking filtering algorithm is fairly simple and it only requires a very small extra computation load.
Keywords :
cepstral analysis; feature extraction; filtering theory; speech recognition; AURORA 2.0 database; MFCC feature extraction algorithm; cepstral mean normalization; frequency masking filtering; human masking mechanism; mel frequency cepstral coefficient; noise robust technique; speech recognition toolkit; Cepstral analysis; Feature extraction; Filtering algorithms; Humans; Mel frequency cepstral coefficient; Noise robustness; Spatial databases; Speech coding; Speech recognition; Testing;
Conference_Titel :
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN :
0-7803-8406-7
DOI :
10.1109/ICOSP.2004.1452739