DocumentCode
2935809
Title
Multimodal Event Detection in User Generated Videos
Author
Cricri, Francesco ; Dabov, Kostadin ; Curcio, Igor D D ; Mate, Sujeet ; Gabbouj, Moncef
Author_Institution
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
fYear
2011
fDate
5-7 Dec. 2011
Firstpage
263
Lastpage
270
Abstract
Nowadays most camera-enabled electronic devices contain various auxiliary sensors such as accelerometers, gyroscopes, compasses, GPS receivers, etc. These sensors are often used during the media acquisition to limit camera degradations such as shake and also to provide some basic tagging information such as the location used in geo-tagging. Surprisingly, exploiting the sensor-recordings modality for high-level event detection has been a subject of rather limited research, further constrained to highly specialized acquisition setups. In this work, we show how these sensor modalities, alone or in combination with content-based analysis, allow inferring information about the video content. In addition, we consider a multi-camera scenario, where multiple user generated recordings of a common scene (e.g., music concerts, public events) are available. In order to understand some higher-level semantics of the recorded media, we jointly analyze the individual video recordings and sensor measurements of the multiple users. The detected semantics include generic interesting events and some more specific events. The detection exploits correlations in the camera motion and in the audio content of multiple users. We show that the proposed multimodal analysis methods perform well on various recordings obtained in real live music performances.
Keywords
cameras; object detection; video signal processing; auxiliary sensor; camera degradations; camera-enabled electronic device; content-based analysis; geo-tagging; high-level event detection; higher-level semantics; media acquisition; multicamera scenario; multimodal analysis; multimodal event detection; real live music performance; sensor modalities; tagging information; user generated video; Accelerometers; Cameras; Compass; Correlation; Multimedia communication; Sensors; Videos; Multimodal; analysis; event; indexing; motion; video;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia (ISM), 2011 IEEE International Symposium on
Conference_Location
Dana Point CA
Print_ISBN
978-1-4577-2015-4
Type
conf
DOI
10.1109/ISM.2011.49
Filename
6123356
Link To Document