Title :
SVM-based audio scene classification
Author :
Jiang, Hongchen ; Bai, Junmei ; Zhang, Shuwu ; Xu, Bo
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
Audio scene classification is very important in audio indexing, retrieval and video content analysis. In this paper we present our approach that uses support vector machine (SVM) for audio scene classification, which classifies audio clips into one of five classes: pure speech, non-pure speech, music, environment sound, and silence. Among them, non-pure speech may further be divided into speech with music and speech with noise. We also describe two methods to select effective and robust audio feature sets. Based on these feature sets, we have evaluated and compared the performance of two kinds of classification frameworks on a testing database that is composed of about 4-hour audio data. The experimental results have shown that the SVM-based method yields high accuracy with high processing speed.
Keywords :
audio signal processing; pattern classification; signal classification; speech processing; support vector machines; SVM-based audio scene classification; audio clip classification; audio feature sets; environment sound; music; nonpure speech; pure speech; support vector machine; Acoustic noise; Content based retrieval; Indexing; Layout; Music information retrieval; Noise robustness; Speech enhancement; Support vector machine classification; Support vector machines; Working environment noise;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598721