• DocumentCode
    2574682
  • Title

    Acoustic topic model for audio information retrieval

  • Author

    Kim, Samuel ; Narayanan, Shrikanth ; Sundaram, Shiva

  • Author_Institution
    Signal Anlaysis & Interpretation Lab. (SAIL), Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2009
  • fDate
    18-21 Oct. 2009
  • Firstpage
    37
  • Lastpage
    40
  • Abstract
    A new algorithm for content-based audio information retrieval is introduced in this work. Assuming that there exist hidden acoustic topics and each audio clip is a mixture of those acoustic topics, we proposed a topic model that learns a probability distribution over a set of hidden topics of a given audio clip in an unsupervised manner. We use the Latent Dirichlet Allocation (LDA) method for the topic model, and introduce the notion of acoustic words for supporting modeling within this framework. In audio description classification tasks using Support Vector Machine (SVM) on the BBC database, the proposed acoustic topic model shows promising results by outperforming the Latent Perceptual Indexing (LPI) method in classifying onomatopoeia descriptions and semantic descriptions.
  • Keywords
    audio acoustics; audio signal processing; content-based retrieval; information retrieval; probability; support vector machines; unsupervised learning; audio acoustic; content-based audio information retrieval; latent dirichlet allocation method; latent perceptual indexing method; onomatopoeia description; probability distribution; support vector machine; Acoustic applications; Content based retrieval; Indexing; Information retrieval; Linear discriminant analysis; Music information retrieval; Psychoacoustic models; Signal processing algorithms; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
  • Conference_Location
    New Paltz, NY
  • ISSN
    1931-1168
  • Print_ISBN
    978-1-4244-3678-1
  • Electronic_ISBN
    1931-1168
  • Type

    conf

  • DOI
    10.1109/ASPAA.2009.5346483
  • Filename
    5346483