DocumentCode :
1966820
Title :
Sharing Aggregate Computation of Multiple Group by Queries over Distributed Data Stream
Author :
Wang, Shuang ; Wang, Guoren ; Gao, Xiaoxing
Author_Institution :
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang
Volume :
4
fYear :
2008
fDate :
12-14 Dec. 2008
Firstpage :
639
Lastpage :
642
Abstract :
Data streaming systems are becoming essential for monitoring applications such as financial analysis, network intrusion detection and sensor network. These systems often have to process multiple similar but different continuous aggregation queries simultaneously. Since executing each query separately can lead to significant scalability and performance problems, it is vital to share resources by exploiting similarities in the queries. The challenge is to identify overlapping computations that may not be obvious in the queries themselves. In this paper, we reveal new opportunities for sharing work in the context of distributed aggregation queries that vary in their group by predicates. We identify settings in which a large set of m such queries can be answered by executing n< m different queries. The n queries are revealed by analyzing the binary two-dimension array capturing the connection among the queries that they satisfy. We propose a novel algorithmic solution for problem of finding the minimum number of queries in such a distributed-streams setting, in order to optimize the communicate cost across the network. The experiment result show that our approach gives us as much as magnitude performance improvement over the no-share settings.
Keywords :
distributed processing; electronic data interchange; query processing; aggregate computation; algorithmic solution; data streaming systems; distributed aggregation queries; distributed data stream; Aggregates; Computer networks; Condition monitoring; Cost function; Distributed computing; Educational institutions; Routing; Scalability; System performance; Wireless sensor networks; aggregation; data stream; distributed system; group by query; multi-query optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
Type :
conf
DOI :
10.1109/CSSE.2008.476
Filename :
4722700
Link To Document :
بازگشت