• DocumentCode
    3013497
  • Title

    Similarity search without tears: the OMNI-family of all-purpose access methods

  • Author

    Filho, Roberto Figueira Santos ; Traina, Agma ; Traina, Caetano, Jr. ; Faloutsos, Christos

  • Author_Institution
    Dept. of Comput. Sci. & Stat., Sao Paulo Univ., Brazil
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    623
  • Lastpage
    630
  • Abstract
    Designing a new access method inside a commercial DBMS is cumbersome and expensive. We propose a family of metric access methods that are fast and easy to implement on top of existing access methods, such as sequential scan, R-trees and Slim-trees. The idea is to elect a set of objects as foci, and gauge all other objects with their distances from this set. We show how to define the foci set cardinality, how to choose appropriate foci, and how to perform range and nearest-neighbor queries using them, without false dismissals. The foci increase the pruning of distance calculations during the query processing. Furthermore we index the distances from each object to the foci to reduce even triangular inequality comparisons. Experiments on real and synthetic datasets show that our methods match or outperform existing methods. They are up to 10 times faster, and perform up to 10 times fewer distance calculations and disk accesses. In addition, it scales up well, exhibiting sub-linear performance with growing database size
  • Keywords
    database indexing; query processing; tree data structures; OMNI; R-trees; Slim-trees; database indexing; experiments; foci set cardinality; metric access methods; nearest-neighbor queries; performance; query processing; range queries; sequential scan; similarity search; Computer science; Database systems; Fingerprint recognition; Image databases; Multimedia databases; Multimedia systems; Nearest neighbor searches; Proteins; Query processing; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2001. Proceedings. 17th International Conference on
  • Conference_Location
    Heidelberg
  • ISSN
    1063-6382
  • Print_ISBN
    0-7695-1001-9
  • Type

    conf

  • DOI
    10.1109/ICDE.2001.914877
  • Filename
    914877