• DocumentCode
    254001
  • Title

    Modeling Image Patches with a Generic Dictionary of Mini-epitomes

  • Author

    Papandreou, George ; Liang-Chieh Chen ; Yuille, Alan L.

  • Author_Institution
    TTI Chicago, Chicago, IL, USA
  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    2059
  • Lastpage
    2066
  • Abstract
    The goal of this paper is to question the necessity of features like SIFT in categorical visual recognition tasks. As an alternative, we develop a generative model for the raw intensity of image patches and show that it can support image classification performance on par with optimized SIFT-based techniques in a bag-of-visual-words setting. Key ingredient of the proposed model is a compact dictionary of mini-epitomes, learned in an unsupervised fashion on a large collection of images. The use of epitomes allows us to explicitly account for photometric and position variability in image appearance. We show that this flexibility considerably increases the capacity of the dictionary to accurately approximate the appearance of image patches and support recognition tasks. For image classification, we develop histogram-based image encoding methods tailored to the epitomic representation, as well as an "epitomic footprint" encoding which is easy to visualize and highlights the generative nature of our model. We discuss in detail computational aspects and develop efficient algorithms to make the model scalable to large tasks. The proposed techniques are evaluated with experiments on the challenging PASCAL VOC 2007 image classification benchmark.
  • Keywords
    image classification; image coding; unsupervised learning; PASCAL VOC 2007 image classification benchmark; SIFT; SIFT-based techniques; bag-of-visual-words setting; categorical visual recognition tasks; epitomic footprint encoding; histogram-based image encoding methods; image collection; image patches modeling; miniepitomes generic dictionary; photometric variability; position variability; unsupervised learning; Computational modeling; Dictionaries; Image coding; Image reconstruction; PSNR; Vectors; Visualization; Image classification; epitomes; image patches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.264
  • Filename
    6909661