DocumentCode
3013497
Title
Similarity search without tears: the OMNI-family of all-purpose access methods
Author
Filho, Roberto Figueira Santos ; Traina, Agma ; Traina, Caetano, Jr. ; Faloutsos, Christos
Author_Institution
Dept. of Comput. Sci. & Stat., Sao Paulo Univ., Brazil
fYear
2001
fDate
2001
Firstpage
623
Lastpage
630
Abstract
Designing a new access method inside a commercial DBMS is cumbersome and expensive. We propose a family of metric access methods that are fast and easy to implement on top of existing access methods, such as sequential scan, R-trees and Slim-trees. The idea is to elect a set of objects as foci, and gauge all other objects with their distances from this set. We show how to define the foci set cardinality, how to choose appropriate foci, and how to perform range and nearest-neighbor queries using them, without false dismissals. The foci increase the pruning of distance calculations during the query processing. Furthermore we index the distances from each object to the foci to reduce even triangular inequality comparisons. Experiments on real and synthetic datasets show that our methods match or outperform existing methods. They are up to 10 times faster, and perform up to 10 times fewer distance calculations and disk accesses. In addition, it scales up well, exhibiting sub-linear performance with growing database size
Keywords
database indexing; query processing; tree data structures; OMNI; R-trees; Slim-trees; database indexing; experiments; foci set cardinality; metric access methods; nearest-neighbor queries; performance; query processing; range queries; sequential scan; similarity search; Computer science; Database systems; Fingerprint recognition; Image databases; Multimedia databases; Multimedia systems; Nearest neighbor searches; Proteins; Query processing; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2001. Proceedings. 17th International Conference on
Conference_Location
Heidelberg
ISSN
1063-6382
Print_ISBN
0-7695-1001-9
Type
conf
DOI
10.1109/ICDE.2001.914877
Filename
914877
Link To Document