DocumentCode :
2250823
Title :
A load-controllable mining system for frequent-pattern discovery in dynamic data streams
Author :
Jea, Kuen-Fang ; Li, Chao-Wei ; Hsu, Chih-wei ; Lin, Ru-ping ; Yen, Ssu-fan
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Chung-Hsing Univ., Taichung, Taiwan
Volume :
5
fYear :
2010
fDate :
11-14 July 2010
Firstpage :
2466
Lastpage :
2471
Abstract :
In many applications, data-stream sources are prone to dramatic spikes in volume, which necessitates load shedding for data-stream processing systems. In this research, we study the load-shedding problem for frequent-pattern discovery in transactional data streams. A load-controllable mining system with an ε-deficient mining algorithm and three dedicated load-shedding schemes is proposed. When the system is overloaded, a load-shedding scheme is executed to prune a fraction of unprocessed data. From the experimental result, we find that the strategies of load shedding can indeed lighten the system workload while preserving the mining accuracy at an acceptable level.
Keywords :
data mining; ε-deficient mining algorithm; dynamic data stream; frequent-pattern discovery; load shedding; load-controllable mining system; load-shedding scheme; Accuracy; Cybernetics; Data mining; Guidelines; Itemsets; Machine learning; Monitoring; Data mining; Data overload; Data stream; Frequent itemset; Frequent pattern; Load shedding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-1-4244-6526-2
Type :
conf
DOI :
10.1109/ICMLC.2010.5580798
Filename :
5580798
Link To Document :
بازگشت