Title :
Mining Recent Frequent Itemsets in Data Streams
Author :
Li, Kun ; Wang, Yong-yan ; Ellahi, Manzoor ; Wang, Hong-an
Author_Institution :
Intell. Eng. Lab., Inst. of Software Chinese Acad. of Sci., Beijing
Abstract :
Mining frequent itemsets in data streams is a hot research topic in recent years. Due to the continuous, high-speed and unbounded properties of data streams, traditional algorithms on static dataset are not suitable for mining in data streams. In this paper we present bounded frequent itemsets stream (abbreviated as BFI-stream) algorithm, which uses a prefix-tree based structure, called BFI-tree, to maintain all accurate frequent itemsets from sliding windows over data streams. By monitoring the boundary between frequent itemsets and infrequent itemsets, it restricts the update process on a small part of the tree. Mining all frequent itemsets with accurate frequencies is just to traverse the tree. It is time efficient even when the user-specified minimum support threshold is small. Experiments compare the time and space usage with MFI-TransSW, which also returns all accurate frequent itemsets from sliding windows. The results show that BFI-stream outperforms MFI-TransSW in both time and space at most time especially when the minimum support is small.
Keywords :
data mining; tree data structures; BFI-stream; MFI-TransSW; bounded frequent itemsets stream; data streams; prefix-tree based structure; Data engineering; Data mining; Delay; Financial management; Frequency; Fuzzy systems; Itemsets; Knowledge engineering; Monitoring; Telecommunication network management; data mining; data stream; frequent itemset;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Jinan Shandong
Print_ISBN :
978-0-7695-3305-6
DOI :
10.1109/FSKD.2008.255