• DocumentCode
    1920647
  • Title

    Optimizing Distributed RDF Triplestores via a Locally Indexed Graph Partitioning

  • Author

    Wang, Rui ; Chiu, Kenneth

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York at Binghamton, Binghamton, NY, USA
  • fYear
    2012
  • fDate
    10-13 Sept. 2012
  • Firstpage
    259
  • Lastpage
    268
  • Abstract
    Semantic web techniques based on RDF are a promising approach to help make sense of the growing deluge in scientific data and conceptually integrate separately administered datasets. However, storing large datasets entirely on a single machine is not scalable, which has led to the concept of distributed triple stores. Existing triple stores, however, fail to take advantage of the non-uniform nature of semantic web data, leading to inefficient data allocation. In this paper, we extend our previous work on uniform graph partitioning to include a local index which is used to filter intermediate results and optimize sub-querying, and a new system design which can better exploit graph partitioned datasets. This also allows us to measure the total communication cost in our simulation. As shown in our experiments, our approach can effectively reduce the communication cost of query-processing messages compared with other approaches, balance the size of partitions compared with other approaches, and enhanced parallelism through independent sub-querying.
  • Keywords
    data handling; distributed processing; graph theory; indexing; query processing; semantic Web; communication cost reduction; distributed RDF triplestore optimization; graph partitioned datasets; independent subquerying; inefficient data allocation; locally indexed graph partitioning; query-processing messages; semantic Web data; semantic Web techniques; Indexes; Parallel processing; Partitioning algorithms; Query processing; Resource description framework; Resource management; System analysis and design; RDF graph; communication cost; dataset partition; distributed RDF store; parallelism; triplestore;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2012 41st International Conference on
  • Conference_Location
    Pittsburgh, PA
  • ISSN
    0190-3918
  • Print_ISBN
    978-1-4673-2508-0
  • Type

    conf

  • DOI
    10.1109/ICPP.2012.47
  • Filename
    6337587