DocumentCode :
3421044
Title :
From Subcategories to Visual Composites: A Multi-level Framework for Object Detection
Author :
Tian Lan ; Raptis, Michalis ; Sigal, Leonid ; Mori, Greg
Author_Institution :
Simon Fraser Univ., Burnaby, BC, Canada
fYear :
2013
fDate :
1-8 Dec. 2013
Firstpage :
369
Lastpage :
376
Abstract :
The appearance of an object changes profoundly with pose, camera view and interactions of the object with other objects in the scene. This makes it challenging to learn detectors based on an object-level label (e.g., "car"). We postulate that having a richer set of labelings (at different levels of granularity) for an object, including finer-grained subcategories, consistent in appearance and view, and higher order composites - contextual groupings of objects consistent in their spatial layout and appearance, can significantly alleviate these problems. However, obtaining such a rich set of annotations, including annotation of an exponentially growing set of object groupings, is simply not feasible. We propose a weakly-supervised framework for object detection where we discover subcategories and the composites automatically with only traditional object-level category labels as input. To this end, we first propose an exemplar-SVM-based clustering approach, with latent SVM refinement, that discovers a variable length set of discriminative subcategories for each object class. We then develop a structured model for object detection that captures interactions among object subcategories and automatically discovers semantically meaningful and discriminatively relevant visual composites. We show that this model produces state-of-the-art performance on UIUC phrase object detection benchmark.
Keywords :
object detection; pattern clustering; support vector machines; UIUC phrase object detection benchmark; camera view; discriminative subcategories; exemplar-SVM-based clustering approach; finer-grained subcategories; granularity level; higher-order composites; latent SVM refinement; multilevel framework; object appearance; object class; object contextual groupings; object subcategories; object-level category label; pose; spatial layout; structured model; variable length set; visual composites; weakly-supervised framework; Context modeling; Detectors; Labeling; Object detection; Support vector machines; Training; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2013.53
Filename :
6751155
Link To Document :
بازگشت