Title :
Unsupervised Learning of Categorical Segments in Image Collections
Author :
Andreetto, M. ; Zelnik-Manor, L. ; Perona, P.
Author_Institution :
Google Los Angeles (US-LAX-BIN), Venice, CA, USA
Abstract :
Which one comes first: segmentation or recognition? We propose a unified framework for carrying out the two simultaneously and without supervision. The framework combines a flexible probabilistic model, for representing the shape and appearance of each segment, with the popular “bag of visual words” model for recognition. If applied to a collection of images, our framework can simultaneously discover the segments of each image and the correspondence between such segments, without supervision. Such recurring segments may be thought of as the “parts” of corresponding objects that appear multiple times in the image collection. Thus, the model may be used for learning new categories, detecting/classifying objects, and segmenting images, without using expensive human annotation.
Keywords :
image recognition; image representation; image segmentation; object recognition; shape recognition; unsupervised learning; bag of visual words model; categorical segments; flexible probabilistic model; human annotation; image collection; image collections; image recognition; image segmentation; object classification; recurring segments; unsupervised learning; Image recognition; Image segmentation; Pattern analysis; Probabilistic logic; Shape; Visualization; Computer vision; density estimation; graphical models; image segmentation; scene analysis.; unsupervised object recognition; Algorithms; Animals; Artificial Intelligence; Databases, Factual; Face; Humans; Image Processing, Computer-Assisted; Markov Chains; Models, Statistical; Monte Carlo Method; Pattern Recognition, Automated; Trees;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2011.268