DocumentCode :
3334458
Title :
The many facets of approximate similarity search
Author :
Patella, Marco ; Ciaccia, Paolo
Author_Institution :
DEIS, Bologna Univ., Bologna
fYear :
2008
fDate :
7-12 April 2008
Firstpage :
308
Lastpage :
319
Abstract :
In this article, we review the major paradigms for approximate similarity queries and propose a classification schema that easily allows existing approaches to be compared along several independent coordinates. Then, we discuss the impact that scheduling of index nodes can have on performance and show that, unlike exact similarity queries, no provable optimal scheduling strategy exists for approximate queries. On the positive side, we show that optimal- on-the-average schedules are well-defined. We complete by critically reviewing methods for evaluating the quality of approximate results.
Keywords :
pattern classification; query processing; approximate queries; approximate similarity search; classification schema; index nodes; optimal scheduling strategy; Costs; Degradation; Euclidean distance; Extraterrestrial measurements; Information retrieval; Optimal scheduling; Spatial resolution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-2161-9
Electronic_ISBN :
978-1-4244-2162-6
Type :
conf
DOI :
10.1109/ICDEW.2008.4498340
Filename :
4498340
Link To Document :
بازگشت