• DocumentCode
    3849656
  • Title

    D-Cache: Universal Distance Cache for Metric Access Methods

  • Author

    Tomas Skopal;Jakub Lokoc;Benjamin Bustos

  • Author_Institution
    Charles University, Prague
  • Volume
    24
  • Issue
    5
  • fYear
    2012
  • Firstpage
    868
  • Lastpage
    881
  • Abstract
    The caching of accessed disk pages has been successfully used for decades in database technology, resulting in effective amortization of I/O operations needed within a stream of query or update requests. However, in modern complex databases, like multimedia databases, the I/O cost becomes a minor performance factor. In particular, metric access methods (MAMs), used for similarity search in complex unstructured data, have been designed to minimize rather the number of distance computations than I/O cost (when indexing or querying). Inspired by I/O caching in traditional databases, in this paper we introduce the idea of distance caching for usage with MAMs - a novel approach to streamline similarity search. As a result, we present the D-cache, a main-memory data structure which can be easily implemented into any MAM, in order to spare the distance computations spent by queries/updates. In particular, we have modified two state-of-the-art MAMs to make use of D-cache - the M-tree and Pivot tables. Moreover, we present the D-file, an index-free MAM based on simple sequential search augmented by D-cache. The experimental evaluation shows that performance gain achieved due to D-cache is significant for all the MAMs, especially for the D-file.
  • Keywords
    "Runtime","Measurement","Indexing","Upper bound","Search problems"
  • Journal_Title
    IEEE Transactions on Knowledge and Data Engineering
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2011.19
  • Filename
    5680908