Title :
Detecting local semantic concepts in environmental sounds using Markov model based clustering
Author :
Lee, Keansub ; Ellis, Daniel P W ; Loui, Alexander C.
Author_Institution :
Dept. of Elec. Eng., Columbia Univ., New York, NY, USA
Abstract :
Detecting the time of occurrence of an acoustic event (for instance, a cheer) embedded in a longer soundtrack is useful and important for applications such as search and retrieval in consumer video archives. We present a Markov-model based clustering algorithm able to identify and segment consistent sets of temporal frames into regions associated with different ground-truth labels, and simultaneously to exclude a set of uninformative frames shared in common from all clips. The labels are provided at the clip level, so this refinement of the time axis represents a variant of Multiple-Instance Learning (MIL). Evaluation shows that local concepts are effectively detected by this clustering technique based on coarse-scale labels, and that detection performance is significantly better than existing algorithms for classifying real-world consumer recordings.
Keywords :
acoustic signal detection; hidden Markov models; video retrieval; video signal processing; Markov clustering model; acoustic detection; casual recordings; consumer recordings; environmental sounds; multiple-instance learning; soundtrack; time axis refinement; video clips; video retrieval; Acoustic applications; Acoustic signal detection; Audio recording; Cameras; Clustering algorithms; Data mining; Event detection; Video recording; Video sharing; YouTube; Audio Segmentation; Environmental Audio; Markov Models; Multiple Instance Learning;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495915