• DocumentCode
    3195159
  • Title

    Unsupervised mining of audiovisually consistent segments in videos with application to structure analysis

  • Author

    Ben, Mathieu ; Gravier, Guillaume

  • Author_Institution
    INRIA Rennes, 35042 Cedex, France
  • fYear
    2011
  • fDate
    11-15 July 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper, a multimodal event mining technique is proposed to discover repeating video segments exhibiting audio and visual consistency in a totally unsupervised manner. The mining strategy first exploits independent audio and visual cluster analysis to provide segments which are consistent in both their visual and audio modalities, thus likely corresponding to a unique underlying event. A subsequent modeling stage using discriminative models enables accurate detection of the underlying event throughout the video. Event mining is applied to unsupervised video structure analysis, using simple heuristics on occurrence patterns of the events discovered to select those relevant to the video structure. Results on TV programs ranging from news to talk shows and games, show that structurally relevant events are discovered with precisions ranging from 87% to 98% and recalls from 59% to 94 %.
  • Keywords
    clustering; content extraction; multimodality; mutual information; video mining; video structuring;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2011 IEEE International Conference on
  • Conference_Location
    Barcelona, Spain
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-61284-348-3
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2011.6011951
  • Filename
    6011951