DocumentCode
3289927
Title
Mining Recent Frequent Itemsets in Data Streams
Author
Li, Kun ; Wang, Yong-yan ; Ellahi, Manzoor ; Wang, Hong-an
Author_Institution
Intell. Eng. Lab., Inst. of Software Chinese Acad. of Sci., Beijing
Volume
4
fYear
2008
fDate
18-20 Oct. 2008
Firstpage
353
Lastpage
358
Abstract
Mining frequent itemsets in data streams is a hot research topic in recent years. Due to the continuous, high-speed and unbounded properties of data streams, traditional algorithms on static dataset are not suitable for mining in data streams. In this paper we present bounded frequent itemsets stream (abbreviated as BFI-stream) algorithm, which uses a prefix-tree based structure, called BFI-tree, to maintain all accurate frequent itemsets from sliding windows over data streams. By monitoring the boundary between frequent itemsets and infrequent itemsets, it restricts the update process on a small part of the tree. Mining all frequent itemsets with accurate frequencies is just to traverse the tree. It is time efficient even when the user-specified minimum support threshold is small. Experiments compare the time and space usage with MFI-TransSW, which also returns all accurate frequent itemsets from sliding windows. The results show that BFI-stream outperforms MFI-TransSW in both time and space at most time especially when the minimum support is small.
Keywords
data mining; tree data structures; BFI-stream; MFI-TransSW; bounded frequent itemsets stream; data streams; prefix-tree based structure; Data engineering; Data mining; Delay; Financial management; Frequency; Fuzzy systems; Itemsets; Knowledge engineering; Monitoring; Telecommunication network management; data mining; data stream; frequent itemset;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location
Jinan Shandong
Print_ISBN
978-0-7695-3305-6
Type
conf
DOI
10.1109/FSKD.2008.255
Filename
4666411
Link To Document