Title :
A load shedding scheme for frequent pattern mining in transactional data streams
Author :
Kuen-Fang Jea ; Chao-Wei Li ; Chih-Wei Hsu ; Ru-Ping Lin ; Ssu-Fan Yen
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Chung-Hsing Univ., Taichung, Taiwan
Abstract :
In this paper, we study overload handling for frequent-pattern mining in online data streams. For a mining system with an e-deficient synopsis based algorithm, we propose a load shedding scheme to deal with the overload situation. The heavy workload of the mining algorithm lies mostly in the great deal of itemsets which need to be enumerated and counted by the mining algorithm. Therefore, our proposed scheme of load shedding involves the maintenance of a smaller set of itemsets, so the workload can be lessened accordingly. The unrecorded itemsets can be fast approximated for their counts when necessary. According to experimental results, the load shedding scheme can increase the throughput of the mining system and thus help manage the overload problem effectively to a certain extent.
Keywords :
data handling; data mining; e-deficient synopsis based algorithm; load shedding scheme; online data streams; overload handling; pattern mining; transactional data stream; Algorithm design and analysis; Approximation methods; Data mining; Data models; Data structures; Itemsets; Throughput; data mining; data overload; data stream; frequent itemset; load shedding;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
DOI :
10.1109/FSKD.2011.6019707