DocumentCode :
3559000
Title :
A Distributed Stream Query Optimization Framework through Integrated Planning and Deployment
Author :
Seshadri, Sangeetha ; Kumar, Vibhore ; Cooper, Brian ; Liu, Ling
Author_Institution :
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
20
Issue :
10
fYear :
2009
Firstpage :
1439
Lastpage :
1453
Abstract :
This paper addresses the problem of optimizing multiple distributed stream queries that are executing simultaneously in distributed data stream systems. We argue that the static query optimization approach of "plan, then deployment" is inadequate for handling distributed queries involving multiple streams and node dynamics faced in distributed data stream systems and applications. Thus, the selection of an optimal execution plan in such dynamic and networked computing systems must consider operator ordering, reuse, network placement, and search space reduction. We propose to use hierarchical network partitions to exploit various opportunities for operator-level reuse while utilizing network characteristics to maintain a manageable search space during query planning and deployment. We develop top-down, bottom-up, and hybrid algorithms for exploiting operator-level reuse through hierarchical network partitions. Formal analysis is presented to establish the bounds on the search space and suboptimality of our algorithms. We have implemented our algorithms in the IFLOW system, an adaptive distributed stream management system. Through simulations and experiments using a prototype deployed on Emulab, we demonstrate the effectiveness of our framework and our algorithms.
Keywords :
distributed databases; distributed processing; query processing; Emulab; IFLOW system; adaptive distributed stream management system; database management system; distributed data stream system; distributed databases; distributed stream query optimization framework; formal analysis; hybrid algorithm; integrated planning; network partition; network placement; networked computing system; operator-level reuse; query planning; query processing; search space reduction; simulation; static query optimization approach; Computer-communication networks; Distributed Systems; Distributed databases; Query processing; database management; distributed applications; distributed databases; distributed systems; query processing.; systems;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
Conference_Location :
10/17/2008 12:00:00 AM
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2008.232
Filename :
4653774
Link To Document :
بازگشت