• DocumentCode
    1037509
  • Title

    DISCOV: A Framework for Discovering Objects in Video

  • Author

    Liu, David ; Chen, Tsuhan

  • Author_Institution
    Carnegie Mellon Univ., Pittsburgh
  • Volume
    10
  • Issue
    2
  • fYear
    2008
  • Firstpage
    200
  • Lastpage
    208
  • Abstract
    This paper presents a probabilistic framework for discovering objects in video. The video can switch between different shots, the unknown objects can leave or enter the scene at multiple times, and the background can be cluttered. The framework consists of an appearance model and a motion model. The appearance model exploits the consistency of object parts in appearance across frames. We use maximally stable extremal regions as observations in the model and hence provide robustness to object variations in scale, lighting and viewpoint. The appearance model provides location and scale estimates of the unknown objects through a compact probabilistic representation. The compact representation contains knowledge of the scene at the object level, thus allowing us to augment it with motion information using a motion model. This framework can be applied to a wide range of different videos and object types, and provides a basis for higher level video content analysis tasks. We present applications of video object discovery to video content analysis problems such as video segmentation and threading, and demonstrate superior performance to methods that exploit global image statistics and frequent itemset data mining techniques.
  • Keywords
    content management; data mining; image motion analysis; image representation; image segmentation; object detection; probability; statistical analysis; unsupervised learning; video signal processing; DISCOV probabilistic framework; compact probabilistic representation; frequent itemset data mining technique; image statistics; unsupervised learning; video appearance model; video content analysis; video motion model; video object discovery; video segmentation; Data mining; Detectors; Humans; Image recognition; Image segmentation; Layout; Object detection; Robustness; Switches; Wheels; Multimedia data mining; unsupervised learning; video object discovery; video segmentation;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2007.911781
  • Filename
    4432624