Title :
Unsupervised discovery of visual object class hierarchies
Author :
Sivic, Josef ; Russell, Bryan C. ; Zisserman, Andrew ; Freeman, William T. ; Efros, Alexei A.
Author_Institution :
INRIA, Ecole Normale Super., Paris
Abstract :
Objects in the world can be arranged into a hierarchy based on their semantic meaning (e.g. organism - animal - feline - cat). What about defining a hierarchy based on the visual appearance of objects? This paper investigates ways to automatically discover a hierarchical structure for the visual world from a collection of unlabeled images. Previous approaches for unsupervised object and scene discovery focused on partitioning the visual data into a set of non-overlapping classes of equal granularity. In this work, we propose to group visual objects using a multi-layer hierarchy tree that is based on common visual elements. This is achieved by adapting to the visual domain the generative hierarchical latent Dirichlet allocation (hLDA) model previously used for unsupervised discovery of topic hierarchies in text. Images are modeled using quantized local image regions as analogues to words in text. Employing the multiple segmentation framework of Russell et al. [22], we show that meaningful object hierarchies, together with object segmentations, can be automatically learned from unlabeled and unsegmented image collections without supervision. We demonstrate improved object classification and localization performance using hLDA over the previous non-hierarchical method on the MSRC dataset [33].
Keywords :
image coding; image segmentation; learning (artificial intelligence); object detection; object recognition; tree data structures; generative hierarchical latent Dirichlet allocation model; group visual object; multilayer hierarchy tree; multiple object segmentation; quantized local image region; scene discovery; semantic meaning; unlabeled image collection learning; unsupervised object; unsupervised object discovery; visual object appearance; visual object class hierarchy; Animal structures; Image analysis; Image sampling; Image segmentation; Labeling; Layout; Object segmentation; Organisms; Switches; Training data;
Conference_Titel :
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4244-2242-5
Electronic_ISBN :
1063-6919
DOI :
10.1109/CVPR.2008.4587622