DocumentCode
2457247
Title
Scalable Multi-query Optimization for SPARQL
Author
Le, Wangchao ; Kementsietsidis, Anastasios ; Duan, Songyun ; Li, Feifei
Author_Institution
Sch. of Comput., Univ. of Utah, Salt Lake City, UT, USA
fYear
2012
fDate
1-5 April 2012
Firstpage
666
Lastpage
677
Abstract
This paper revisits the classical problem of multi-query optimization in the context of RDF/SPARQL. We show that the techniques developed for relational and semi-structured data/query languages are hard, if not impossible, to be extended to account for RDF data model and graph query patterns expressed in SPARQL. In light of the NP-hardness of the multi-query optimization for SPARQL, we propose heuristic algorithms that partition the input batch of queries into groups such that each group of queries can be optimized together. An essential component of the optimization incorporates an efficient algorithm to discover the common sub-structures of multiple SPARQL queries and an effective cost model to compare candidate execution plans. Since our optimization techniques do not make any assumption about the underlying SPARQL query engine, they have the advantage of being portable across different RDF stores. The extensive experimental studies, performed on three popular RDF stores, show that the proposed techniques are effective, efficient and scalable.
Keywords
computational complexity; data models; query languages; query processing; relational databases; NP-hardness; RDF data model; RDF/SPARQL; SPARQL query engine; graph query patterns; relational data/query languages; scalable multiquery optimization; semistructured data/query languages; Buildings; Context; Optimization; Partitioning algorithms; Pattern matching; Resource description framework; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering (ICDE), 2012 IEEE 28th International Conference on
Conference_Location
Washington, DC
ISSN
1063-6382
Print_ISBN
978-1-4673-0042-1
Type
conf
DOI
10.1109/ICDE.2012.37
Filename
6228123
Link To Document