DocumentCode
3559000
Title
A Distributed Stream Query Optimization Framework through Integrated Planning and Deployment
Author
Seshadri, Sangeetha ; Kumar, Vibhore ; Cooper, Brian ; Liu, Ling
Author_Institution
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
Volume
20
Issue
10
fYear
2009
Firstpage
1439
Lastpage
1453
Abstract
This paper addresses the problem of optimizing multiple distributed stream queries that are executing simultaneously in distributed data stream systems. We argue that the static query optimization approach of "plan, then deployment" is inadequate for handling distributed queries involving multiple streams and node dynamics faced in distributed data stream systems and applications. Thus, the selection of an optimal execution plan in such dynamic and networked computing systems must consider operator ordering, reuse, network placement, and search space reduction. We propose to use hierarchical network partitions to exploit various opportunities for operator-level reuse while utilizing network characteristics to maintain a manageable search space during query planning and deployment. We develop top-down, bottom-up, and hybrid algorithms for exploiting operator-level reuse through hierarchical network partitions. Formal analysis is presented to establish the bounds on the search space and suboptimality of our algorithms. We have implemented our algorithms in the IFLOW system, an adaptive distributed stream management system. Through simulations and experiments using a prototype deployed on Emulab, we demonstrate the effectiveness of our framework and our algorithms.
Keywords
distributed databases; distributed processing; query processing; Emulab; IFLOW system; adaptive distributed stream management system; database management system; distributed data stream system; distributed databases; distributed stream query optimization framework; formal analysis; hybrid algorithm; integrated planning; network partition; network placement; networked computing system; operator-level reuse; query planning; query processing; search space reduction; simulation; static query optimization approach; Computer-communication networks; Distributed Systems; Distributed databases; Query processing; database management; distributed applications; distributed databases; distributed systems; query processing.; systems;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
Conference_Location
10/17/2008 12:00:00 AM
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2008.232
Filename
4653774
Link To Document