DocumentCode :
3195110
Title :
An Improved Implementation for an Auditory-Inspired FFT Model with Application in Audio Classification
Author :
Chu, Wei ; Champagne, Benoît
Author_Institution :
McGill Univ., Montreal
fYear :
2007
fDate :
2-5 July 2007
Firstpage :
196
Lastpage :
199
Abstract :
In this paper, we present an improved implementation for an auditory-inspired FFT-based model which calculates a noise-robust FFT spectrum. Through the use of characteristic frequency (CF) values of the cochlear filters in an early auditory (EA) model for power spectrum selection, and the use of a pair of running averages for the implementation of self-normalization, the proposed FFT model allows more flexibility in the extraction of audio features. To evaluate the performance of the proposed FFT model, a speech/music/noise classification task is carried out wherein a decision tree learning algorithm (C4.5) is used as the classifier. Audio features used for classification include the mel-frequency cepstral coefficient (MFCC) features, a set of conventional spectral features, and spectral features calculated using the proposed FFT model. Compared to the conventional MFCC and spectral features, the spectral features based on the proposed FFT model show more robust performance in noisy test cases.
Keywords :
audio signal processing; fast Fourier transforms; feature extraction; signal classification; audio classification; auditory-inspired FFT model; characteristic frequency values; cochlear filters; decision tree learning algorithm; early auditory model; mel-frequency cepstral coefficient; power spectrum selection; Cepstral analysis; Classification tree analysis; Decision trees; Feature extraction; Filters; Mel frequency cepstral coefficient; Noise robustness; Speech analysis; Speech enhancement; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2007 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-1016-9
Electronic_ISBN :
1-4244-1017-7
Type :
conf
DOI :
10.1109/ICME.2007.4284620
Filename :
4284620
Link To Document :
بازگشت