DocumentCode :
1920647
Title :
Optimizing Distributed RDF Triplestores via a Locally Indexed Graph Partitioning
Author :
Wang, Rui ; Chiu, Kenneth
Author_Institution :
Dept. of Comput. Sci., State Univ. of New York at Binghamton, Binghamton, NY, USA
fYear :
2012
fDate :
10-13 Sept. 2012
Firstpage :
259
Lastpage :
268
Abstract :
Semantic web techniques based on RDF are a promising approach to help make sense of the growing deluge in scientific data and conceptually integrate separately administered datasets. However, storing large datasets entirely on a single machine is not scalable, which has led to the concept of distributed triple stores. Existing triple stores, however, fail to take advantage of the non-uniform nature of semantic web data, leading to inefficient data allocation. In this paper, we extend our previous work on uniform graph partitioning to include a local index which is used to filter intermediate results and optimize sub-querying, and a new system design which can better exploit graph partitioned datasets. This also allows us to measure the total communication cost in our simulation. As shown in our experiments, our approach can effectively reduce the communication cost of query-processing messages compared with other approaches, balance the size of partitions compared with other approaches, and enhanced parallelism through independent sub-querying.
Keywords :
data handling; distributed processing; graph theory; indexing; query processing; semantic Web; communication cost reduction; distributed RDF triplestore optimization; graph partitioned datasets; independent subquerying; inefficient data allocation; locally indexed graph partitioning; query-processing messages; semantic Web data; semantic Web techniques; Indexes; Parallel processing; Partitioning algorithms; Query processing; Resource description framework; Resource management; System analysis and design; RDF graph; communication cost; dataset partition; distributed RDF store; parallelism; triplestore;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing (ICPP), 2012 41st International Conference on
Conference_Location :
Pittsburgh, PA
ISSN :
0190-3918
Print_ISBN :
978-1-4673-2508-0
Type :
conf
DOI :
10.1109/ICPP.2012.47
Filename :
6337587
Link To Document :
بازگشت