DocumentCode
3334458
Title
The many facets of approximate similarity search
Author
Patella, Marco ; Ciaccia, Paolo
Author_Institution
DEIS, Bologna Univ., Bologna
fYear
2008
fDate
7-12 April 2008
Firstpage
308
Lastpage
319
Abstract
In this article, we review the major paradigms for approximate similarity queries and propose a classification schema that easily allows existing approaches to be compared along several independent coordinates. Then, we discuss the impact that scheduling of index nodes can have on performance and show that, unlike exact similarity queries, no provable optimal scheduling strategy exists for approximate queries. On the positive side, we show that optimal- on-the-average schedules are well-defined. We complete by critically reviewing methods for evaluating the quality of approximate results.
Keywords
pattern classification; query processing; approximate queries; approximate similarity search; classification schema; index nodes; optimal scheduling strategy; Costs; Degradation; Euclidean distance; Extraterrestrial measurements; Information retrieval; Optimal scheduling; Spatial resolution;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-2161-9
Electronic_ISBN
978-1-4244-2162-6
Type
conf
DOI
10.1109/ICDEW.2008.4498340
Filename
4498340
Link To Document