DocumentCode
3322837
Title
Flow Algorithms for Parallel Query Optimization
Author
Deshpande, Amol ; Hellerstein, Lisa
Author_Institution
Univ. of Maryland, College Park, MD
fYear
2008
fDate
7-12 April 2008
Firstpage
754
Lastpage
763
Abstract
We address the problem of minimizing the response time of a multi-way join query using pipelined (inter-operator) parallelism, in a parallel or a distributed environment. We observe that in order to fully exploit the parallelism in the system, we must consider a new class of ";interleaving"; plans, where multiple query plans are used simultaneously to minimize the response time of a query (or to maximize the tuple-throughput of the system). We cast the query planning problem in this environment as a ";flow maximization problem";, and present polynomial-time algorithms that (statically) find the optimal set of plans to use for a given query, for a large class of multi-way join queries. Our proposed algorithms also naturally extend to query optimization over web services. Finally we present an extensive experimental evaluation that demonstrates both the need to consider such plans in parallel query processing and the effectiveness of our algorithms.
Keywords
computational complexity; minimisation; parallel databases; pipeline processing; query processing; Web service; distributed environment; flow maximization algorithm; interoperator parallelism; multiway join query response time minimization problem; parallel database; parallel query optimization; pipelined parallelism; polynomial-time algorithm; query planning problem; Casting; Cost function; Databases; Delay; Educational institutions; Interleaved codes; Partitioning algorithms; Pipeline processing; Query processing; Web services;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-1836-7
Electronic_ISBN
978-1-4244-1837-4
Type
conf
DOI
10.1109/ICDE.2008.4497484
Filename
4497484
Link To Document