SVM-based audio scene classification

Author

Jiang, Hongchen ; Bai, Junmei ; Zhang, Shuwu ; Xu, Bo

Author_Institution

Inst. of Autom., Chinese Acad. of Sci., Beijing, China

fYear

2005

fDate

30 Oct.-1 Nov. 2005

Firstpage

131

Lastpage

136

Abstract

Audio scene classification is very important in audio indexing, retrieval and video content analysis. In this paper we present our approach that uses support vector machine (SVM) for audio scene classification, which classifies audio clips into one of five classes: pure speech, non-pure speech, music, environment sound, and silence. Among them, non-pure speech may further be divided into speech with music and speech with noise. We also describe two methods to select effective and robust audio feature sets. Based on these feature sets, we have evaluated and compared the performance of two kinds of classification frameworks on a testing database that is composed of about 4-hour audio data. The experimental results have shown that the SVM-based method yields high accuracy with high processing speed.

Keywords

audio signal processing; pattern classification; signal classification; speech processing; support vector machines; SVM-based audio scene classification; audio clip classification; audio feature sets; environment sound; music; nonpure speech; pure speech; support vector machine; Acoustic noise; Content based retrieval; Indexing; Layout; Music information retrieval; Noise robustness; Speech enhancement; Support vector machine classification; Support vector machines; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on

Print_ISBN

0-7803-9361-9

Type

conf

DOI

10.1109/NLPKE.2005.1598721

Filename

1598721