• DocumentCode
    2288074
  • Title

    Beyond the Euclidean distance: Creating effective visual codebooks using the Histogram Intersection Kernel

  • Author

    Wu, Jianxin ; Rehg, James M.

  • Author_Institution
    Center for Robot. & Intell. Machines, Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2009
  • fDate
    Sept. 29 2009-Oct. 2 2009
  • Firstpage
    630
  • Lastpage
    637
  • Abstract
    Common visual codebook generation methods used in a Bag of Visual words model, e.g. k-means or Gaussian Mixture Model, use the Euclidean distance to cluster features into visual code words. However, most popular visual descriptors are histograms of image measurements. It has been shown that the Histogram Intersection Kernel (HIK) is more effective than the Euclidean distance in supervised learning tasks with histogram features. In this paper, we demonstrate that HIK can also be used in an unsupervised manner to significantly improve the generation of visual codebooks. We propose a histogram kernel k-means algorithm which is easy to implement and runs almost as fast as k-means. The HIK codebook has consistently higher recognition accuracy over k-means codebooks by 2-4%. In addition, we propose a one-class SVM formulation to create more effective visual code words which can achieve even higher accuracy. The proposed method has established new state-of-the-art performance numbers for 3 popular benchmark datasets on object and scene recognition. In addition, we show that the standard k-median clustering method can be used for visual codebook generation and can act as a compromise between HIK and k-means approaches.
  • Keywords
    Gaussian processes; feature extraction; learning (artificial intelligence); pattern clustering; support vector machines; Euclidean distance; Gaussian mixture model; SVM formulation; histogram intersection kernel; k-median clustering method; supervised learning tasks; visual codebook generation methods; Code standards; Computational efficiency; Data mining; Euclidean distance; Histograms; Intelligent robots; Kernel; Layout; Support vector machines; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2009 IEEE 12th International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1550-5499
  • Print_ISBN
    978-1-4244-4420-5
  • Electronic_ISBN
    1550-5499
  • Type

    conf

  • DOI
    10.1109/ICCV.2009.5459178
  • Filename
    5459178