Title :
Optimization of Distributed SPARQL Queries Using Edmonds´ Algorithm and Prim´s Algorithm
Author :
Vandervalk, Benjamin P. ; McCarthy, E. Luke ; Wilkinson, Mark D.
Author_Institution :
James Hogg iCAPTURE Centre for Cardiovascular, Univ. of British Columbia, Vancouver, BC, Canada
Abstract :
Here we describe the application of two well known graph algorithms, Edmonds´ algorithm and Prim´s algorithm, to the problem of optimizing distributed SPARQL queries. In the context of this paper, a ldquodistributed SPARQL queryrdquo is a SPARQL query which is resolved by contacting any number of remote SPARQL endpoints. Two optimization approaches are described. In the first approach, a static query plan is computed in advance of query execution, using one of two standard graph algorithms for finding minimum spanning trees (Edmonds´ algorithm and Prim´s algorithm). In the second approach, the planning and execution of the query are interleaved, so that as each potential solution is expanded it is permitted to follow an independent query plan. Our optimization approach requires basic statistics regarding RDF predicates which must be obtained prior to the user´s query, through automated querying of the remote SPARQL endpoints.
Keywords :
distributed algorithms; query languages; query processing; Edmonds algorithm; Prim algorithm; automated querying; distributed SPARQL query; graph algorithm; spanning trees; static query plan; Cardiology; Database languages; Distributed computing; Engines; Heart; Lungs; Query processing; Resource description framework; Statistical distributions; Tree graphs; distributed; semantic web; sparql;
Conference_Titel :
Computational Science and Engineering, 2009. CSE '09. International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-5334-4
Electronic_ISBN :
978-0-7695-3823-5
DOI :
10.1109/CSE.2009.144