مرکز منطقه ای اطلاع رساني علوم و فناوري - On the complexity of reverse similarity search

DocumentCode :

3334720

Title :

On the complexity of reverse similarity search

Author :

Skala, Matthew

Author_Institution :

David R. Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON

fYear :

2008

fDate :

7-12 April 2008

Firstpage :

436

Lastpage :

443

Abstract :

Two decision problems are presented that arise from reversing the operation of a distance-based indexing tree. Whereas similarity search finds points in the tree given a query point, reverse similarity search begins with a set of constraints like those defining a leaf and generates a point meeting the constraints. These problems derive from robust hashing, a technique used in similarity search and security applications. The problems are analysed for spaces of strings and vectors with a variety of metrics: strings with Hamming distance; the usual (Levenshtein) edit distance; an edit distance we introduce called Superghost distance; arbitrary weighted tree metrics; and real vectors with Minkowski L_P metrics (of which the Euclidean distance is a special case). They are found to inhabit different complexity classes depending on the metric. In particular, the reverse similarity search problem derived from a VP- or GH-tree is NP-complete for any L_P metric except that it is in P for a GH-tree with the Euclidean metric.

Keywords :

computational complexity; database indexing; dynamic programming; query processing; tree data structures; Hamming distance; Levenshtein edit distance; Minkowski Lp metrics; Superghost distance; decision problems; distance-based indexing tree; dynamic programming; query point; reverse similarity search complexity; weighted tree metrics; Binary trees; Computer science; Data structures; Euclidean distance; Extraterrestrial measurements; Hamming distance; Indexing; Robustness; Search problems; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on

Conference_Location :

Cancun

Print_ISBN :

978-1-4244-2161-9

Electronic_ISBN :

978-1-4244-2162-6

Type :

conf

DOI :

10.1109/ICDEW.2008.4498355

Filename :

4498355

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3334720