Title of article :
Towards Efficient SPARQL Query Processing on RDF Data
Author/Authors :
LIU, Chang Shanghai Jiao Tong University - Department of Computer Science and Engineering, China , WANG, Haofen Shanghai Jiao Tong University - Department of Computer Science and Engineering, China , YU, Yong Shanghai Jiao Tong University - Department of Computer Science and Engineering, China , XU, Linhao IBM China Research Laboratory, China
From page :
613
To page :
622
Abstract :
Efficient support for querying large-scale resource description framework (RDF) triples plays an important role in semantic web data management. This paper presents an efficient RDF query engine to evaluate SPARQL queries, where the inverted index structure is employed for indexing the RDF triples. A set of operators on the inverted index was developed for query optimization and evaluation. Then a main-tree-shaped optimization algorithm was developed that transforms a SPARQL query graph into the optimal query plan by effectively reducing the search space to determine the optimal joining order. The optimization collects a set of RDF statistics for estimating the execution cost of the query plan. Finally the optimalquery plan is evaluated using the defined operators for answering the given SPARQL query. Extensive tests were conducted on both synthetic and real datasets containing up to 100 million triples to evaluate this approach with the results showing that this approach can answer most queries within 1 s and is extremely efficient and scalable in comparison with previous best state-of-the-art RDF stores.
Keywords :
resource description framework (RDF) query engine , SPARQL , optimization
Journal title :
Tsinghua Science and Technology
Journal title :
Tsinghua Science and Technology
Record number :
2535334
Link To Document :
بازگشت