Title :
A load-controllable mining system for frequent-pattern discovery in dynamic data streams
Author :
Jea, Kuen-Fang ; Li, Chao-Wei ; Hsu, Chih-wei ; Lin, Ru-ping ; Yen, Ssu-fan
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Chung-Hsing Univ., Taichung, Taiwan
Abstract :
In many applications, data-stream sources are prone to dramatic spikes in volume, which necessitates load shedding for data-stream processing systems. In this research, we study the load-shedding problem for frequent-pattern discovery in transactional data streams. A load-controllable mining system with an ε-deficient mining algorithm and three dedicated load-shedding schemes is proposed. When the system is overloaded, a load-shedding scheme is executed to prune a fraction of unprocessed data. From the experimental result, we find that the strategies of load shedding can indeed lighten the system workload while preserving the mining accuracy at an acceptable level.
Keywords :
data mining; ε-deficient mining algorithm; dynamic data stream; frequent-pattern discovery; load shedding; load-controllable mining system; load-shedding scheme; Accuracy; Cybernetics; Data mining; Guidelines; Itemsets; Machine learning; Monitoring; Data mining; Data overload; Data stream; Frequent itemset; Frequent pattern; Load shedding;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-1-4244-6526-2
DOI :
10.1109/ICMLC.2010.5580798