DocumentCode
3317029
Title
SVM-based audio scene classification
Author
Jiang, Hongchen ; Bai, Junmei ; Zhang, Shuwu ; Xu, Bo
Author_Institution
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fYear
2005
fDate
30 Oct.-1 Nov. 2005
Firstpage
131
Lastpage
136
Abstract
Audio scene classification is very important in audio indexing, retrieval and video content analysis. In this paper we present our approach that uses support vector machine (SVM) for audio scene classification, which classifies audio clips into one of five classes: pure speech, non-pure speech, music, environment sound, and silence. Among them, non-pure speech may further be divided into speech with music and speech with noise. We also describe two methods to select effective and robust audio feature sets. Based on these feature sets, we have evaluated and compared the performance of two kinds of classification frameworks on a testing database that is composed of about 4-hour audio data. The experimental results have shown that the SVM-based method yields high accuracy with high processing speed.
Keywords
audio signal processing; pattern classification; signal classification; speech processing; support vector machines; SVM-based audio scene classification; audio clip classification; audio feature sets; environment sound; music; nonpure speech; pure speech; support vector machine; Acoustic noise; Content based retrieval; Indexing; Layout; Music information retrieval; Noise robustness; Speech enhancement; Support vector machine classification; Support vector machines; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN
0-7803-9361-9
Type
conf
DOI
10.1109/NLPKE.2005.1598721
Filename
1598721
Link To Document