• DocumentCode
    2935809
  • Title

    Multimodal Event Detection in User Generated Videos

  • Author

    Cricri, Francesco ; Dabov, Kostadin ; Curcio, Igor D D ; Mate, Sujeet ; Gabbouj, Moncef

  • Author_Institution
    Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
  • fYear
    2011
  • fDate
    5-7 Dec. 2011
  • Firstpage
    263
  • Lastpage
    270
  • Abstract
    Nowadays most camera-enabled electronic devices contain various auxiliary sensors such as accelerometers, gyroscopes, compasses, GPS receivers, etc. These sensors are often used during the media acquisition to limit camera degradations such as shake and also to provide some basic tagging information such as the location used in geo-tagging. Surprisingly, exploiting the sensor-recordings modality for high-level event detection has been a subject of rather limited research, further constrained to highly specialized acquisition setups. In this work, we show how these sensor modalities, alone or in combination with content-based analysis, allow inferring information about the video content. In addition, we consider a multi-camera scenario, where multiple user generated recordings of a common scene (e.g., music concerts, public events) are available. In order to understand some higher-level semantics of the recorded media, we jointly analyze the individual video recordings and sensor measurements of the multiple users. The detected semantics include generic interesting events and some more specific events. The detection exploits correlations in the camera motion and in the audio content of multiple users. We show that the proposed multimodal analysis methods perform well on various recordings obtained in real live music performances.
  • Keywords
    cameras; object detection; video signal processing; auxiliary sensor; camera degradations; camera-enabled electronic device; content-based analysis; geo-tagging; high-level event detection; higher-level semantics; media acquisition; multicamera scenario; multimodal analysis; multimodal event detection; real live music performance; sensor modalities; tagging information; user generated video; Accelerometers; Cameras; Compass; Correlation; Multimedia communication; Sensors; Videos; Multimodal; analysis; event; indexing; motion; video;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia (ISM), 2011 IEEE International Symposium on
  • Conference_Location
    Dana Point CA
  • Print_ISBN
    978-1-4577-2015-4
  • Type

    conf

  • DOI
    10.1109/ISM.2011.49
  • Filename
    6123356