Title :
A scalable approach to approximating aggregate queries over intermittent streams
Author :
Zhu, Shanzhong ; Ravishankar, Chinya
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., Riverside, CA, USA
Abstract :
We present a novel approach to approximate evaluation of standing aggregate queries over streaming data, subject to user-specified error bounds. Our method models the behavior of aggregates as Brownian motions, and adoptively updates the model according to stream characteristics. This approach has two advantages. First, it greatly improves system scalability since we can defer query evaluation as long as the difference between the returned and true aggregate values remains within user-specified bounds. Second, we are able to provide approximate answers during stream interruptions by estimating the rate at which the streams and the aggregate drift during the blackout periods. We also study processor allocation issues in such approximate aggregate evaluation systems. Our experiments show that our model captures the behavior of real-world streams such as sensor data and stock traces with excellent fidelity, and scales very well for large numbers of standing queries.
Keywords :
Brownian motion; error handling; processor scheduling; query processing; resource allocation; Brownian motion; aggregate query approximation; aggregate values; approximate aggregate evaluation systems; blackout periods; intermittent streams; processor allocation; query evaluation; real-world streams; sensor data; stock traces; stream interruptions; streaming data; system scalability; user-specified error bounds; Aggregates; Computer science; Data engineering; Monitoring; Portfolios; Query processing; Real time systems; Scalability; Temperature sensors; Yarn;
Conference_Titel :
Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-2146-0
DOI :
10.1109/SSDM.2004.1311196