• DocumentCode
    3601371
  • Title

    Supervised Hashing Using Graph Cuts and Boosted Decision Trees

  • Author

    Guosheng Lin ; Chunhua Shen ; van den Hengel, Anton

  • Author_Institution
    Australian Res. Council Centre of Excellence for Robotic Vision, Univ. of Adelaide, Adelaide, SA, Australia
  • Volume
    37
  • Issue
    11
  • fYear
    2015
  • Firstpage
    2317
  • Lastpage
    2331
  • Abstract
    To build large-scale query-by-example image retrieval systems, embedding image features into a binary Hamming space provides great benefits. Supervised hashing aims to map the original features to compact binary codes that are able to preserve label based similarity in the binary Hamming space. Most existing approaches apply a single form of hash function, and an optimization process which is typically deeply coupled to this specific form. This tight coupling restricts the flexibility of those methods, and can result in complex optimization problems that are difficult to solve. In this work we proffer a flexible yet simple framework that is able to accommodate different types of loss functions and hash functions. The proposed framework allows a number of existing approaches to hashing to be placed in context, and simplifies the development of new problem-specific hashing methods. Our framework decomposes the hashing learning problem into two steps: binary code (hash bit) learning and hash function learning. The first step can typically be formulated as binary quadratic problems, and the second step can be accomplished by training a standard binary classifier. For solving large-scale binary code inference, we show how it is possible to ensure that the binary quadratic problems are submodular such that efficient graph cut methods may be used. To achieve efficiency as well as efficacy on large-scale high-dimensional data, we propose to use boosted decision trees as the hash functions, which are nonlinear, highly descriptive, and are very fast to train and evaluate. Experiments demonstrate that the proposed method significantly outperforms most state-of-the-art methods, especially on high-dimensional data.
  • Keywords
    Hamming codes; binary codes; decision trees; file organisation; graph theory; image retrieval; learning (artificial intelligence); optimisation; pattern classification; binary Hamming space; binary classifier; binary code inference; binary code learning; binary quadratic problems; boosted decision trees; compact binary codes; graph cut methods; hash bit learning; hash function learning; hashing learning problem; label based similarity; optimization process; query-by-example image retrieval systems; supervised hashing; Binary codes; Decision trees; Hamming distance; Inference algorithms; Kernel; Optimization; Training; Binary Codes; Decision Trees; Graph Cuts; Hashing; Image Retrieval; Nearest Neighbour Search; binary codes; decision trees; graph cuts; image retrieval; nearest neighbour search;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2015.2404776
  • Filename
    7044591