• DocumentCode
    2805476
  • Title

    Detecting local semantic concepts in environmental sounds using Markov model based clustering

  • Author

    Lee, Keansub ; Ellis, Daniel P W ; Loui, Alexander C.

  • Author_Institution
    Dept. of Elec. Eng., Columbia Univ., New York, NY, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    2278
  • Lastpage
    2281
  • Abstract
    Detecting the time of occurrence of an acoustic event (for instance, a cheer) embedded in a longer soundtrack is useful and important for applications such as search and retrieval in consumer video archives. We present a Markov-model based clustering algorithm able to identify and segment consistent sets of temporal frames into regions associated with different ground-truth labels, and simultaneously to exclude a set of uninformative frames shared in common from all clips. The labels are provided at the clip level, so this refinement of the time axis represents a variant of Multiple-Instance Learning (MIL). Evaluation shows that local concepts are effectively detected by this clustering technique based on coarse-scale labels, and that detection performance is significantly better than existing algorithms for classifying real-world consumer recordings.
  • Keywords
    acoustic signal detection; hidden Markov models; video retrieval; video signal processing; Markov clustering model; acoustic detection; casual recordings; consumer recordings; environmental sounds; multiple-instance learning; soundtrack; time axis refinement; video clips; video retrieval; Acoustic applications; Acoustic signal detection; Audio recording; Cameras; Clustering algorithms; Data mining; Event detection; Video recording; Video sharing; YouTube; Audio Segmentation; Environmental Audio; Markov Models; Multiple Instance Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495915
  • Filename
    5495915