Title :
Size Bounds and Query Plans for Relational Joins
Author :
Atserias, Albert ; Grohe, Martin ; Marx, Dániel
Author_Institution :
Univ. Politec. de Catalunya, Barcelona
Abstract :
Relational joins are at the core of relational algebra, which in turn is the core of the standard database query language SQL. As their evaluation is expensive and very often dominated by the output size, it is an important task for database query optimisers to compute estimates on the size of joins and to find good execution plans for sequences of joins. We study these problems from a theoretical perspective, both in the worst-case model, and in an average-case model where the database is chosen according to a known probability distribution. In the former case, our first key observation is that the worst-case size of a query is characterised by the fractional edge cover number of its underlying hypergraph, a combinatorial parameter previously known to provide an upper bound. We complete the picture by proving a matching lower bound, and by showing that there exist queries for which the join-project plan suggested by the fractional edge cover approach may be substantially better than any join plan that does not use intermediate projections.
Keywords :
graph theory; probability; query languages; relational algebra; SQL; average-case model; combinatorial parameter; database query language; fractional edge cover approach; hypergraph; probability distribution; query plan; relational algebra; relational joins; size bound; worst-case model; Algebra; Computer science; Cost function; Database languages; Database systems; Engines; Probability distribution; Query processing; Relational databases; Upper bound;
Conference_Titel :
Foundations of Computer Science, 2008. FOCS '08. IEEE 49th Annual IEEE Symposium on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3436-7
DOI :
10.1109/FOCS.2008.43