• DocumentCode
    1134080
  • Title

    Content-based audio classification and retrieval by support vector machines

  • Author

    Guo, Guodong ; Li, Stan Z.

  • Author_Institution
    Comput. Sci. Dept., Univ. of Wisconsin, Madison, WI, USA
  • Volume
    14
  • Issue
    1
  • fYear
    2003
  • fDate
    1/1/2003 12:00:00 AM
  • Firstpage
    209
  • Lastpage
    215
  • Abstract
    Support vector machines (SVMs) have been recently proposed as a new learning algorithm for pattern recognition. In this paper, the SVMs with a binary tree recognition strategy are used to tackle the audio classification problem. We illustrate the potential of SVMs on a common audio database, which consists of 409 sounds of 16 classes. We compare the SVMs based classification with other popular approaches. For audio retrieval, we propose a new metric, called distance-from-boundary (DFB). When a query audio is given, the system first finds a boundary inside which the query pattern is located. Then, all the audio patterns in the database are sorted by their distances to this boundary. All boundaries are learned by the SVMs and stored together with the audio database. Experimental comparisons for audio retrieval are presented to show the superiority of this novel metric to other similarity measures.
  • Keywords
    audio coding; content-based retrieval; learning automata; pattern recognition; binary tree recognition; common audio database; content-based audio classification; content-based audio retrieval; distance-from-boundary; learning algorithm; pattern recognition; similarity measures; support vector machines; Application software; Audio databases; Binary trees; Brightness; Classification tree analysis; Content based retrieval; Neural networks; Pattern recognition; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2002.806626
  • Filename
    1176140