DocumentCode
1920647
Title
Optimizing Distributed RDF Triplestores via a Locally Indexed Graph Partitioning
Author
Wang, Rui ; Chiu, Kenneth
Author_Institution
Dept. of Comput. Sci., State Univ. of New York at Binghamton, Binghamton, NY, USA
fYear
2012
fDate
10-13 Sept. 2012
Firstpage
259
Lastpage
268
Abstract
Semantic web techniques based on RDF are a promising approach to help make sense of the growing deluge in scientific data and conceptually integrate separately administered datasets. However, storing large datasets entirely on a single machine is not scalable, which has led to the concept of distributed triple stores. Existing triple stores, however, fail to take advantage of the non-uniform nature of semantic web data, leading to inefficient data allocation. In this paper, we extend our previous work on uniform graph partitioning to include a local index which is used to filter intermediate results and optimize sub-querying, and a new system design which can better exploit graph partitioned datasets. This also allows us to measure the total communication cost in our simulation. As shown in our experiments, our approach can effectively reduce the communication cost of query-processing messages compared with other approaches, balance the size of partitions compared with other approaches, and enhanced parallelism through independent sub-querying.
Keywords
data handling; distributed processing; graph theory; indexing; query processing; semantic Web; communication cost reduction; distributed RDF triplestore optimization; graph partitioned datasets; independent subquerying; inefficient data allocation; locally indexed graph partitioning; query-processing messages; semantic Web data; semantic Web techniques; Indexes; Parallel processing; Partitioning algorithms; Query processing; Resource description framework; Resource management; System analysis and design; RDF graph; communication cost; dataset partition; distributed RDF store; parallelism; triplestore;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing (ICPP), 2012 41st International Conference on
Conference_Location
Pittsburgh, PA
ISSN
0190-3918
Print_ISBN
978-1-4673-2508-0
Type
conf
DOI
10.1109/ICPP.2012.47
Filename
6337587
Link To Document