• DocumentCode
    3292651
  • Title

    On the Complexity of Reverse Similarity Search

  • Author

    Skala, Matthew

  • Author_Institution
    Univ. of Waterloo, Waterloo
  • fYear
    2008
  • fDate
    11-12 April 2008
  • Firstpage
    149
  • Lastpage
    156
  • Abstract
    Two decision problems are presented that arise from reversing the operation of a distance-based indexing tree. Whereas similarity search finds points in the tree given a query point, reverse similarity search begins with a set of constraints like those defining a leaf and generates a point meeting the constraints. These problems derive from robust hashing, a technique used in similarity search and security applications. The problems are analysed for spaces of strings and vectors with a variety of metrics: strings with Hamming distance; the usual (Levenshtein) edit distance; an edit distance we introduce called Superghost distance; arbitrary weighted tree metrics; and real vectors with Minkowski Lp metrics (of which the Euclidean distance is a special case). They are found to inhabit different complexity classes depending on the metric. In particular, the reverse similarity search problem derived from a VP- or GH-tree is NP-complete for any Lp metric except that it is in P for a GH-tree with the Euclidean metric.
  • Keywords
    computational complexity; database indexing; query processing; search problems; string matching; tree data structures; GH-tree; Hamming distance; Minkowski Lp metrics; Superghost distance; VP-tree; binary strings; binary tree; decision problems; distance-based indexing tree; edit distance; query point; reverse similarity search complexity; weighted tree metrics; Application software; Binary trees; Data structures; Euclidean distance; Extraterrestrial measurements; Hamming distance; Indexing; Robustness; Search problems; Vectors; NP-completeness; edit distance; robust hash; similarity search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Similarity Search and Applications, 2008. SISAP 2008. First International Workshop on
  • Conference_Location
    Belfast
  • Print_ISBN
    0-7695-3101-6
  • Type

    conf

  • DOI
    10.1109/SISAP.2008.16
  • Filename
    4492936