Title :
Mining Approximate Closed Frequent Itemsets over Stream
Author :
Li, Haifeng ; Lu, Zongjian ; Chen, Hong
Author_Institution :
Sch. of Inf., Renmin Univ. of China, Beijing
Abstract :
Frequent itemset mining is a very important problem in data mining. Closed frequent itemsets is the condensed representation of frequent itemsets thus spend less memory, so it is much suitable for stream mining. But on the other hand, when the minimum support is much lower, the size of closed frequent itemsets turns larger, which makes the performance reduced a lot. In this paper, we introduce a threshold to approximately mine closed frequent itemsets with a limited error tolerance. A new algorithm named ACFIM is proposed based on the introduction of the distance conception to mine the sliding window of stream, in which more data are pruned and more computation time are saved, so it much raise the performance in running time and memory comparing to the state-of-art closed frequent itemsets mining methods. Our experimental results over real-life datasets show that ACFIM is effective and efficient.
Keywords :
data mining; ACFIM algorithm; approximate closed frequent itemset; data mining; frequent itemset mining; sliding window; stream mining; Artificial intelligence; Data engineering; Data mining; Distributed computing; Itemsets; Knowledge engineering; Laboratories; Software engineering; Transaction databases; Writing; closed frequent itemset; stream;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on
Conference_Location :
Phuket
Print_ISBN :
978-0-7695-3263-9
DOI :
10.1109/SNPD.2008.32