Title :
A Graph Partitioning Approach to Distributed RDF Stores
Author :
Wang, Rui ; Chiu, Kenneth
Author_Institution :
Comput. Sci. Dept., State Univ. of New York at Binghamton, Binghamton, NY, USA
Abstract :
With growing of Semantic Web data, especially RDF data, managing large RDF dataset on a single machine does not scale well. Previous work has explored how to distribute RDF triples to multiple machines. However due to inefficient dataset partitioning used by these solutions, the performance of distributed store system is significantly affected. In this paper, we proposed a promising approach that utilized the graph nature of RDF datasets to minimize relations between partitions after dataset partitioning, and optimized system design based on it. As shown in our experiments, our approach can effectively reduce communication cost of query-processing messages, balance size of partitions compared with other approaches, and enhance parallelism through independent sub-querying.
Keywords :
data models; distributed processing; graph theory; query processing; semantic Web; storage management; RDF triples; dataset partitioning; distributed RDF stores; distributed store system; graph partitioning; optimized system design; query-processing messages; semantic Web data; Benchmark testing; Distributed databases; Indexes; Parallel processing; Peer to peer computing; Resource description framework; Resource management; RDF graph; communication cost; dataset partition; distributed RDF store; parallelism; triplestore;
Conference_Titel :
Parallel and Distributed Processing with Applications (ISPA), 2012 IEEE 10th International Symposium on
Conference_Location :
Leganes
Print_ISBN :
978-1-4673-1631-6
DOI :
10.1109/ISPA.2012.60