• DocumentCode
    3317029
  • Title

    SVM-based audio scene classification

  • Author

    Jiang, Hongchen ; Bai, Junmei ; Zhang, Shuwu ; Xu, Bo

  • Author_Institution
    Inst. of Autom., Chinese Acad. of Sci., Beijing, China
  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    131
  • Lastpage
    136
  • Abstract
    Audio scene classification is very important in audio indexing, retrieval and video content analysis. In this paper we present our approach that uses support vector machine (SVM) for audio scene classification, which classifies audio clips into one of five classes: pure speech, non-pure speech, music, environment sound, and silence. Among them, non-pure speech may further be divided into speech with music and speech with noise. We also describe two methods to select effective and robust audio feature sets. Based on these feature sets, we have evaluated and compared the performance of two kinds of classification frameworks on a testing database that is composed of about 4-hour audio data. The experimental results have shown that the SVM-based method yields high accuracy with high processing speed.
  • Keywords
    audio signal processing; pattern classification; signal classification; speech processing; support vector machines; SVM-based audio scene classification; audio clip classification; audio feature sets; environment sound; music; nonpure speech; pure speech; support vector machine; Acoustic noise; Content based retrieval; Indexing; Layout; Music information retrieval; Noise robustness; Speech enhancement; Support vector machine classification; Support vector machines; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598721
  • Filename
    1598721