• DocumentCode
    2961287
  • Title

    Audiovisual event detection towards scene understanding

  • Author

    Canton-Ferrer, C. ; Butko, Taras ; Segura, Carlos ; Giro, X. ; Nadeu, Climent ; Hernando, Juan ; Casas, J.R.

  • Author_Institution
    Tech. Univ. of Catalonia, Barcelona, Spain
  • fYear
    2009
  • fDate
    20-25 June 2009
  • Firstpage
    81
  • Lastpage
    88
  • Abstract
    Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a multimodal perspective is presented combining information from multiple cameras and microphones. First, spectral and temporal features are extracted from a single audio channel and spatial localization is achieved by exploiting cross-correlation among microphone arrays. Second, several video cues obtained from multiperson tracking, motion analysis, face recognition, and object detection provide the visual counterpart of the acoustic events to be detected. A multimodal data fusion at score level is carried out using two approaches: weighted mean average and fuzzy integral. Finally, a multimodal database containing a rich variety of acoustic events has been recorded including manual annotations of the data. A set of metrics allow assessing the performance of the presented algorithms. This dataset is made publicly available for research purposes.
  • Keywords
    audio signal processing; face recognition; motion estimation; object detection; sensor fusion; transforms; video signal processing; acoustic events; audiovisual event detection; face recognition; motion analysis; multi-person tracking; multimodal database; object detection; spectral features; temporal features; weighted mean average; Acoustic signal detection; Cameras; Data mining; Event detection; Feature extraction; Information analysis; Layout; Microphone arrays; Object detection; Tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition Workshops, 2009. CVPR Workshops 2009. IEEE Computer Society Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    2160-7508
  • Print_ISBN
    978-1-4244-3994-2
  • Type

    conf

  • DOI
    10.1109/CVPRW.2009.5204264
  • Filename
    5204264