Title :
Clustering Frequent Itemsets Based on Generators
Author :
Li, Jinhong ; Yang, Bingru ; Song, Wei ; Hou, Wei
Abstract :
How to reduce the number of frequent itemsets effectively is a hot topic in data mining research. Clustering frequent itemsets is one solution to the problem. Since generators are lossless concise representations of all frequent itemsets, clustering generators is equivalent to clustering all frequent itemsets. This paper proposes a new algorithm for clustering frequent itemsets based on generators. Firstly, based on minimum description length principle, the rationality of clustering generators is discussed. Secondly, the pruning strategies and mining algorithm for generators are proposed. Finally, based on a new similarity criterion of frequent itemsets, the clustering algorithm is presented. Experimental results show that the proposed method can not only reduce the number of discovered itemsets, but also is efficient.
Keywords :
data mining; pattern clustering; clustering frequent itemsets based on generators; clustering generators; data mining research; mining algorithm; pruning strategies; Association rules; Clustering algorithms; Data engineering; Data mining; Educational institutions; Information technology; Itemsets; Noise generators; Noise robustness; Proposals; clustering; data mining; frequent itemset; generator;
Conference_Titel :
Intelligent Information Technology Application, 2008. IITA '08. Second International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3497-8
DOI :
10.1109/IITA.2008.293