DocumentCode :
1037509
Title :
DISCOV: A Framework for Discovering Objects in Video
Author :
Liu, David ; Chen, Tsuhan
Author_Institution :
Carnegie Mellon Univ., Pittsburgh
Volume :
10
Issue :
2
fYear :
2008
Firstpage :
200
Lastpage :
208
Abstract :
This paper presents a probabilistic framework for discovering objects in video. The video can switch between different shots, the unknown objects can leave or enter the scene at multiple times, and the background can be cluttered. The framework consists of an appearance model and a motion model. The appearance model exploits the consistency of object parts in appearance across frames. We use maximally stable extremal regions as observations in the model and hence provide robustness to object variations in scale, lighting and viewpoint. The appearance model provides location and scale estimates of the unknown objects through a compact probabilistic representation. The compact representation contains knowledge of the scene at the object level, thus allowing us to augment it with motion information using a motion model. This framework can be applied to a wide range of different videos and object types, and provides a basis for higher level video content analysis tasks. We present applications of video object discovery to video content analysis problems such as video segmentation and threading, and demonstrate superior performance to methods that exploit global image statistics and frequent itemset data mining techniques.
Keywords :
content management; data mining; image motion analysis; image representation; image segmentation; object detection; probability; statistical analysis; unsupervised learning; video signal processing; DISCOV probabilistic framework; compact probabilistic representation; frequent itemset data mining technique; image statistics; unsupervised learning; video appearance model; video content analysis; video motion model; video object discovery; video segmentation; Data mining; Detectors; Humans; Image recognition; Image segmentation; Layout; Object detection; Robustness; Switches; Wheels; Multimedia data mining; unsupervised learning; video object discovery; video segmentation;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2007.911781
Filename :
4432624
Link To Document :
بازگشت