• DocumentCode
    1389929
  • Title

    Learning Hybrid Image Templates (HIT) by Information Projection

  • Author

    Si, Zhangzhang ; Zhu, Song-Chun

  • Author_Institution
    Stat. Dept., Univ. of California Los Angeles, Los Angeles, CA, USA
  • Volume
    34
  • Issue
    7
  • fYear
    2012
  • fDate
    7/1/2012 12:00:00 AM
  • Firstpage
    1354
  • Lastpage
    1367
  • Abstract
    This paper presents a novel framework for learning a generative image representation-the hybrid image template (HIT) from a small number (i.e., 3 sim 20) of image examples. Each learned template is composed of, typically, 50 sim 500 image patches whose geometric attributes (location, scale, orientation) may adapt in a local neighborhood for deformation, and whose appearances are characterized, respectively, by four types of descriptors: local sketch (edge or bar), texture gradients with orientations, flatness regions, and colors. These heterogeneous patches are automatically ranked and selected from a large pool according to their information gains using an information projection framework. Intuitively, a patch has a higher information gain if 1) its feature statistics are consistent within the training examples and are distinctive from the statistics of negative examples (i.e., generic images or examples from other categories); and 2) its feature statistics have less intraclass variations. The learning process pursues the most informative (for either generative or discriminative purpose) patches one at a time and stops when the information gain is within statistical fluctuation. The template is associated with a well-normalized probability model that integrates the heterogeneous feature statistics. This automated feature selection procedure allows our algorithm to scale up to a wide range of image categories, from those with regular shapes to those with stochastic texture. The learned representation captures the intrinsic characteristics of the object or scene categories. We evaluate the hybrid image templates on several public benchmarks, and demonstrate classification performances on par with state-of-the-art methods like HoG+SVM, and when small training sample sizes are used, the proposed system shows a clear advantage.
  • Keywords
    feature extraction; gradient methods; image representation; image texture; learning (artificial intelligence); probability; statistical analysis; stochastic processes; HIT; HoG; SVM; feature selection procedure; flatness region; geometric attribute; heterogeneous feature statistics; heterogeneous image patch; hybrid image template; image representation; information gain; information projection framework; learning process; local sketch; normalized probability model; stochastic texture; texture gradient; Deformable models; Histograms; Image color analysis; Lattices; Prototypes; Shape; Support vector machines; Image representation; deformable templates; information projection; statistical modeling.; visual learning;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2011.227
  • Filename
    6095562