DocumentCode
2025990
Title
Efficient single-pass frequent itemsets mining over data streams
Author
Tan, Jun ; Bu, Yingyong ; Zhao, Haiming
Author_Institution
Coll. of Comput. & Inf. Eng., Central South Univ. of Forestry & Technol., Changsha, China
Volume
3
fYear
2010
fDate
10-12 Aug. 2010
Firstpage
1438
Lastpage
1441
Abstract
Finding frequent itemsets is one of the most important issues in mining data streams for many applications such as web click stream mining, sensor networks, and network traffic analysis. Most prominent algorithms for traditional transaction databases need multiple scans, therefore, they are not suitable for data streams which are continuous, unbounded, usually come with high speed. In this paper, we propose a new single -pass algorithms which use the FP-tree data structure in combination with the IT-matrix technique which greatly reduces the need to traverse FP-trees. The experiment results on synthetic datasets and real datasets show that our proposed algorithm is an efficient method for mining frequent itemsets over data streams.
Keywords
data mining; tree data structures; FP-tree data structure; IT-matrix technique; Web click stream mining; data stream mining; efficient single-pass frequent itemset mining; network traffic analysis; sensor networks; single -pass algorithms; synthetic datasets; transaction databases; traverse FP-trees; Algorithm design and analysis; Construction industry; Data mining; Data structures; Itemsets; Radiation detectors; Data streams; FP-growth; Frequent itemsets; IT-matrix;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location
Yantai, Shandong
Print_ISBN
978-1-4244-5931-5
Type
conf
DOI
10.1109/FSKD.2010.5569199
Filename
5569199
Link To Document