DocumentCode :
1896606
Title :
Mining Accurate Top-K Frequent Closed Itemset from Data Stream
Author :
Xiaojun, Cao
Author_Institution :
Inf. Eng. Sch., Lanzhou Univ. of Finance & Econ., Lanzhou, China
Volume :
2
fYear :
2012
fDate :
23-25 March 2012
Firstpage :
180
Lastpage :
184
Abstract :
Frequent Closed Item set mining on data streams is of great significance. Though a minimum support threshold is assumed to be available in classical mining, it is hard to determine it in data streams. Hence, it is more reasonable to ask users to set a bound on the result size. Therefore, a real-time single-pass algorithm, called Top-k frequent closed item sets and a new way of updating the minimum support were proposed for mining top-K closed item sets from data streams efficiently. A novel algorithm, called Can(T), is developed for mining the essential candidate of closed item sets generated so far. Experimental results show that the proposed the algorithm in this paper is an efficient method for mining top-K frequent item sets from data streams.
Keywords :
data mining; Can(T) algorithm; data streams; minimum support threshold; real-time single-pass algorithm; top-k frequent closed item set mining; Algorithm design and analysis; Approximation algorithms; Association rules; Data structures; Itemsets; Lattices; closed frequent itemsets; data streams; top-K;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Electronics Engineering (ICCSEE), 2012 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-0689-8
Type :
conf
DOI :
10.1109/ICCSEE.2012.263
Filename :
6187930
Link To Document :
بازگشت