• DocumentCode
    3334637
  • Title

    An empirical evaluation of a distributed clustering-based index for metric space databases

  • Author

    Gil-Costa, Veronica ; Marin, Mauricio ; Reyes, Nora

  • Author_Institution
    Dept. de Inf., Univ. Nac. de San Luis, San Luis
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    386
  • Lastpage
    393
  • Abstract
    Similarity search has been proved suitable for searching in very large collections of unstructured data objects. We are interested in efficient parallel query processing under situations of continuous streams of queries as in search engines. A number of sequential index data structures for this purpose have been proposed so far. This paper focuses on one representative of a class of these data structures, namely one based on clustering for which we evaluate different ways of distributing the index to support parallelism on a set of processors. Our study reveals that the intuitive method for both data distribution and model of computing are not efficient in practice. The best results are obtained with a strategy that appears to be more costly in construction but we show that in practice this cost is not significant.
  • Keywords
    client-server systems; data structures; database indexing; parallel processing; pattern clustering; query processing; very large databases; client-server system; distributed clustering-based index; large databases; metric space databases; parallel query processing; sequential index data structures; similarity search; Data structures; Distributed computing; Distributed databases; Extraterrestrial measurements; Indexes; Nearest neighbor searches; Parallel processing; Query processing; Search engines; Traffic control; BSP; Data Structures; Metric Space; Parallel Search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-2161-9
  • Electronic_ISBN
    978-1-4244-2162-6
  • Type

    conf

  • DOI
    10.1109/ICDEW.2008.4498349
  • Filename
    4498349