Title :
Efficient Load Shedding for Streaming Sliding Window Joins
Author :
Ren, Jia-dong ; Jiang, Wan-chang ; Huo, Cong
Author_Institution :
YanShan Univ., Qinhuangdao
Abstract :
We present a novel load shedding technique, called range loading shedding (denoted as RANGE), for sliding window joins when CPU capacity is insufficient in the system and the details of the distribution of streams are unknown. To obtain the statistics of data, we dynamically maintain clustering range histogram (CR-histogram) and average density counter table (ADC-table) for each sliding window. The CR-Histogram is constructed and maintained by clustering technique with a fixed amount of memory. When CPU capacity is insufficient, the RANGE technique is used to select tuples to be processed by utilizing the CR-Histogram and ADC-table, and then produces maximum subset join outputs. Experimental results on synthetic and real life data show that Range load shedding approach obtains max-subset results effectively, and outperforms the existing load shedding strategies.
Keywords :
data analysis; load shedding; pattern clustering; query processing; average density counter table; clustering range histogram; maximum subset join outputs; range loading shedding; sliding window joins; Counting circuits; Cybernetics; Data analysis; Educational institutions; Histograms; Information science; Internet; Machine learning; Query processing; Statistical distributions; Clustering technique; Data streams; Histogram; Load shedding; Sliding window join;
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
DOI :
10.1109/ICMLC.2007.4370389