Title :
CliqueSquare: Flat plans for massively parallel RDF queries
Author :
Goasdoue, Francois ; Kaoudi, Zoi ; Manolescu, Ioana ; Quiane-Ruiz, Jorge-Arnulfo ; Zampetakis, Stamatis
Author_Institution :
Univ. Rennes 1, Rennes, France
Abstract :
As increasing volumes of RDF data are being produced and analyzed, many massively distributed architectures have been proposed for storing and querying this data. These architectures are characterized first, by their RDF partitioning and storage method, and second, by their approach for distributed query optimization, i.e., determining which operations to execute on each node in order to compute the query answers. We present CliqueSquare, a novel optimization approach for evaluating conjunctive RDF queries in a massively parallel environment. We focus on reducing query response time, and thus seek to build flat plans, where the number of joins encountered on a root-to-leaf path in the plan is minimized. We present a family of optimization algorithms, relying on n-ary (star) equality joins to build flat plans, and compare their ability to find the flattest possibles. We have deployed our algorithms in a MapReduce-based RDF platform and demonstrate experimentally the interest of the flat plans built by our best algorithms.
Keywords :
data handling; optimisation; parallel processing; query processing; CliqueSquare; MapReduce-based RDF platform; distributed architectures; optimization approach; parallel RDF queries; Buildings; Distributed databases; Optimization; Parallel processing; Query processing; Resource description framework; Time factors;
Conference_Titel :
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICDE.2015.7113332