Title :
A Simplified Early Auditory Model with Application in Speech/Music Classification
Author :
Chu, Wei ; Champagne, Benoit
Author_Institution :
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, Que.
Abstract :
The past decade has seen extensive research on audio classification and segmentation algorithms. However, the effect of background noise on the performance of classification has not been investigated widely. Recently, an early auditory model that calculates a so-called auditory spectrum has been employed in audio classification where excellent performance is reported along with robustness in noisy environment. Unfortunately, this early auditory model is characterized by high computational requirements and the use of nonlinear processing. In this paper, by introducing certain modifications we propose a simplified version of this model which is linear except for the calculation of the square-root value of the energy. A speech/music classification task is carried out to evaluate the classification performance wherein a support vector machine (SVM) is used as the classifier. Compared to a conventional FFT-based spectrum, both the original auditory spectrum and the proposed simplified auditory spectrum show more robust performance in noisy test cases. Test results also indicate that, with a reduced computational complexity, the performance of the proposed simplified auditory spectrum is close to that of the original auditory spectrum
Keywords :
audio signal processing; music; signal classification; speech processing; support vector machines; FFT-based spectrum; audio classification; audio segmentation; auditory spectrum; music classification; noisy test case; nonlinear processing; simplified auditory model; speech classification; square-root value; support vector machine; Acoustic noise; Background noise; Classification algorithms; Computational complexity; Noise robustness; Speech analysis; Support vector machine classification; Support vector machines; Testing; Working environment noise; Audio classification; auditory spectrum; early auditory model; noise-robustness;
Conference_Titel :
Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on
Conference_Location :
Ottawa, Ont.
Print_ISBN :
1-4244-0038-4
Electronic_ISBN :
1-4244-0038-4
DOI :
10.1109/CCECE.2006.277665