DocumentCode :
3013497
Title :
Similarity search without tears: the OMNI-family of all-purpose access methods
Author :
Filho, Roberto Figueira Santos ; Traina, Agma ; Traina, Caetano, Jr. ; Faloutsos, Christos
Author_Institution :
Dept. of Comput. Sci. & Stat., Sao Paulo Univ., Brazil
fYear :
2001
fDate :
2001
Firstpage :
623
Lastpage :
630
Abstract :
Designing a new access method inside a commercial DBMS is cumbersome and expensive. We propose a family of metric access methods that are fast and easy to implement on top of existing access methods, such as sequential scan, R-trees and Slim-trees. The idea is to elect a set of objects as foci, and gauge all other objects with their distances from this set. We show how to define the foci set cardinality, how to choose appropriate foci, and how to perform range and nearest-neighbor queries using them, without false dismissals. The foci increase the pruning of distance calculations during the query processing. Furthermore we index the distances from each object to the foci to reduce even triangular inequality comparisons. Experiments on real and synthetic datasets show that our methods match or outperform existing methods. They are up to 10 times faster, and perform up to 10 times fewer distance calculations and disk accesses. In addition, it scales up well, exhibiting sub-linear performance with growing database size
Keywords :
database indexing; query processing; tree data structures; OMNI; R-trees; Slim-trees; database indexing; experiments; foci set cardinality; metric access methods; nearest-neighbor queries; performance; query processing; range queries; sequential scan; similarity search; Computer science; Database systems; Fingerprint recognition; Image databases; Multimedia databases; Multimedia systems; Nearest neighbor searches; Proteins; Query processing; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2001. Proceedings. 17th International Conference on
Conference_Location :
Heidelberg
ISSN :
1063-6382
Print_ISBN :
0-7695-1001-9
Type :
conf
DOI :
10.1109/ICDE.2001.914877
Filename :
914877
Link To Document :
بازگشت