DocumentCode
3627712
Title
Efficient Construction of Compact Shedding Filters for Data Stream Processing
Author
Bugra Gedik;Kun-Lung Wu;Philip S. Yu
Author_Institution
Thomas J. Watson Research Center, IBM Research, 19 Skyline Dr, Hawthorne, NY 10532. bgedik@us.ibm.com
fYear
2008
Firstpage
396
Lastpage
405
Abstract
High-volume source streams, coupled with fluctuating rates, necessitate adaptive load shedding in data stream processing. When ignored, a continual query (CQ) server may randomly drop items, when its capacity is inadequate to handle the arriving data, and degrade the quality of the query results. To alleviate this problem, filters can be used at the source nodes. However, regular source filtering in itself is not sufficient to prevent random dropping, because the amount of data passing through the filters can still surpass the server´s capacity. In this case, intelligent load shedding can be applied by the source filters to minimize the degradation in result quality. In this paper, we introduce a novel type of load-shedding source filters, called Non-uniformly Regulated (NR) sifters. An NR sifter judiciously applies varying amounts of load shedding to different regions of the data space within the sifter. We formulate the problem of constructing NR sifters as an optimization one. NR sifters are compact and quickly configurable, allowing frequent adaptations, and provide fast lookup for deciding if a data item should be dropped. We structure NR sifters as a set of (sifter region, drop threshold) pairs to achieve compactness, develop query consolidation techniques to enable quick construction, and introduce flexible space partitioning mechanisms to realize fast lookup.
Keywords
"Filtering","Matched filters","Adaptive filters","Degradation","Statistics","Information processing","Adaptive systems","Data mining","Stress","Costs"
Publisher
ieee
Conference_Titel
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
ISSN
1063-6382
Print_ISBN
978-1-4244-1836-7
Electronic_ISBN
2375-026X
Type
conf
DOI
10.1109/ICDE.2008.4497448
Filename
4497448
Link To Document