• DocumentCode
    949125
  • Title

    BoostMap: An Embedding Method for Efficient Nearest Neighbor Retrieval

  • Author

    Athitsos, Vassilis ; Alon, Jonathan ; Sclaroff, Stan ; Kollios, George

  • Author_Institution
    Univ. of Texas at Arlington, Arlington
  • Volume
    30
  • Issue
    1
  • fYear
    2008
  • Firstpage
    89
  • Lastpage
    104
  • Abstract
    This paper describes BoostMap, a method for efficient nearest neighbor retrieval under computationally expensive distance measures. Database and query objects are embedded into a vector space in which distances can be measured efficiently. Each embedding is treated as a classifier that predicts for any three objects X, A, B whether X is closer to A or to B. It is shown that a linear combination of such embedding-based classifiers naturally corresponds to an embedding and a distance measure. Based on this property, the BoostMap method reduces the problem of embedding construction to the classical boosting problem of combining many weak classifiers into an optimized strong classifier. The classification accuracy of the resulting strong classifier is a direct measure of the amount of nearest neighbor structure preserved by the embedding. An important property of BoostMap is that the embedding optimization criterion is equally valid in both metric and nonmetric spaces. Performance is evaluated in databases of hand images, handwritten digits, and time series. In all cases, BoostMap significantly improves retrieval efficiency with small losses in accuracy compared to brute-force search. Moreover, BoostMap significantly outperforms existing nearest neighbor retrieval methods such as Lipschitz embeddings, FastMap, and VP-trees.
  • Keywords
    database management systems; pattern classification; query processing; BoostMap embedding method; brute-force search; database objects; database querying; embedding-based classifiers; nearest neighbor retrieval; Indexing methods; embedding methods; multimedia databases; nearest neighbor classification; nearest neighbor retrieval; non-Euclidean spaces; similarity matching; Algorithms; Artificial Intelligence; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2007.1140
  • Filename
    4359304