DocumentCode
254001
Title
Modeling Image Patches with a Generic Dictionary of Mini-epitomes
Author
Papandreou, George ; Liang-Chieh Chen ; Yuille, Alan L.
Author_Institution
TTI Chicago, Chicago, IL, USA
fYear
2014
fDate
23-28 June 2014
Firstpage
2059
Lastpage
2066
Abstract
The goal of this paper is to question the necessity of features like SIFT in categorical visual recognition tasks. As an alternative, we develop a generative model for the raw intensity of image patches and show that it can support image classification performance on par with optimized SIFT-based techniques in a bag-of-visual-words setting. Key ingredient of the proposed model is a compact dictionary of mini-epitomes, learned in an unsupervised fashion on a large collection of images. The use of epitomes allows us to explicitly account for photometric and position variability in image appearance. We show that this flexibility considerably increases the capacity of the dictionary to accurately approximate the appearance of image patches and support recognition tasks. For image classification, we develop histogram-based image encoding methods tailored to the epitomic representation, as well as an "epitomic footprint" encoding which is easy to visualize and highlights the generative nature of our model. We discuss in detail computational aspects and develop efficient algorithms to make the model scalable to large tasks. The proposed techniques are evaluated with experiments on the challenging PASCAL VOC 2007 image classification benchmark.
Keywords
image classification; image coding; unsupervised learning; PASCAL VOC 2007 image classification benchmark; SIFT; SIFT-based techniques; bag-of-visual-words setting; categorical visual recognition tasks; epitomic footprint encoding; histogram-based image encoding methods; image collection; image patches modeling; miniepitomes generic dictionary; photometric variability; position variability; unsupervised learning; Computational modeling; Dictionaries; Image coding; Image reconstruction; PSNR; Vectors; Visualization; Image classification; epitomes; image patches;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location
Columbus, OH
Type
conf
DOI
10.1109/CVPR.2014.264
Filename
6909661
Link To Document