• DocumentCode
    2292866
  • Title

    Hierarchical Gaussianization for image classification

  • Author

    Zhou, Xi ; Cui, Na ; Li, Zhen ; Liang, Feng ; Huang, Thomas S.

  • Author_Institution
    Dept. of ECE, Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2009
  • fDate
    Sept. 29 2009-Oct. 2 2009
  • Firstpage
    1971
  • Lastpage
    1977
  • Abstract
    In this paper, we propose a new image representation to capture both the appearance and spatial information for image classification applications. First, we model the feature vectors, from the whole corpus, from each image and at each individual patch, in a Bayesian hierarchical framework using mixtures of Gaussians. After such a hierarchical Gaussianization, each image is represented by a Gaussian mixture model (GMM) for its appearance, and several Gaussian maps for its spatial layout. Then we extract the appearance information from the GMM parameters, and the spatial information from global and local statistics over Gaussian maps. Finally, we employ a supervised dimension reduction technique called DAP (discriminant attribute projection) to remove noise directions and to further enhance the discriminating power of our representation. We justify that the traditional histogram representation and the spatial pyramid matching are special cases of our hierarchical Gaussianization. We compare our new representation with other approaches in scene classification, object recognition and face recognition, and our performance ranks among the top in all three tasks.
  • Keywords
    Gaussian processes; face recognition; image classification; image representation; object recognition; Bayesian hierarchical framework; Gaussian maps; Gaussian mixture model; discriminant attribute projection; face recognition; feature vectors; hierarchical Gaussianization; image classification applications; image representation; object recognition; spatial information; spatial pyramid matching; supervised dimension reduction technique; traditional histogram representation; Bayesian methods; Data mining; Digital audio players; Gaussian processes; Histograms; Image classification; Image representation; Layout; Noise reduction; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2009 IEEE 12th International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1550-5499
  • Print_ISBN
    978-1-4244-4420-5
  • Electronic_ISBN
    1550-5499
  • Type

    conf

  • DOI
    10.1109/ICCV.2009.5459435
  • Filename
    5459435