Title :
Searching in metric spaces by spatial approximation
Author :
Navarro, Gonzalo
Author_Institution :
Dept. of Comput. Sci., Chile Univ., Santiago, Chile
Abstract :
We propose a novel data structure to search in metric spaces. A metric space is formed by a collection of objects and a distance function defined among them, which satisfies the triangular inequality. The goal is, given a set of objects and a query, retrieve those objects close enough to the query. The number of distances computed to achieve this goal is the complexity measure. Our data structure, called sa-tree (“spatial approximation tree”), is based on approaching spatially the searched objects. We analyze our method and show that the number of distance evaluations to search among n objects is o(n). We show experimentally that the sa-tree is the best existing technique when the metric space is high-dimensional or the query has low selectivity. These are the most difficult cases in real applications
Keywords :
computational complexity; information retrieval; tree data structures; visual databases; complexity measure; data structure; distance evaluations; distance function; metric space; metric space searching; real applications; sa-tree; searched objects; spatial approximation; spatial approximation tree; triangular inequality; Audio databases; Computer science; Extraterrestrial measurements; Image coding; Image databases; Information retrieval; Machine learning; Performance evaluation; Quantization; Tree data structures;
Conference_Titel :
String Processing and Information Retrieval Symposium, 1999 and International Workshop on Groupware
Conference_Location :
Cancun
Print_ISBN :
0-7695-0268-7
DOI :
10.1109/SPIRE.1999.796589