Title :
Auditory model based modified MFCC features
Author :
Chatterjee, Saikat ; Kleijn, W. Bastiaan
Author_Institution :
ACCESS Linnaeus Center, KTH-R. Inst. of Technol., Stockholm, Sweden
Abstract :
Using spectral and spectro-temporal auditory models, we develop a computationally simple feature vector based on the design architecture of existing mel frequency cepstral coefficients (MFCCs). Along with the use of an optimized static function to compress a set of filter bank energies, we propose to use a memory-based adaptive compression function to incorporate the behavior of human auditory response across time and frequency. We show that a significant improvement in automatic speech recognition (ASR) performance is obtained for any environmental condition, clean as well as noisy.
Keywords :
data compression; hearing; physiological models; speech processing; speech recognition; ASR performance; automatic speech recognition; feature vector; filter bank energy; human auditory response; mel frequency cepstral coefficients; memory based adaptive compression function; modified MFCC features; optimized static function; spectral auditory model; spectro-temporal auditory model; Acceleration; Auditory system; Automatic speech recognition; Computational complexity; Filter bank; Humans; Mel frequency cepstral coefficient; Psychoacoustic models; Signal processing; Time factors; ASR; MFCC; auditory model;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495557