DocumentCode :
2249888
Title :
A graph-based algorithm for frequent closed itemsets mining
Author :
Li, Li ; Zhai, Donghai ; Jin, Fan
Author_Institution :
Sch. of Comput. & Commun. Eng., Southwest Jiaotong Univ., Chengdu, China
fYear :
2003
fDate :
24-25 April 2003
Firstpage :
19
Lastpage :
24
Abstract :
Frequent itemsets mining plays an essential role in data mining, but it often generates a large number of redundant itemsets that reduce the efficiency of the mining task. Frequent closed itemsets are subset of frequent itemsets, but they contain all information of frequent itemsets. The most existing methods of frequent closed itemset mining are apriori-based. The efficiency of those methods is limited to the repeated database scan and the candidate set generation. We propose a graph-based algorithm for mining frequent closed itemsets called GFCG (graph-based frequent closed itemset generation). The new algorithm constructs an association graph to represent the frequent relationship between items, and recursively generates frequent closed itemset based on that graph. It scans the database for only two times, and avoids candidate set generation. GFCG outperforms a priori-based algorithm in experiment study and shows good performance both in speed and scale up properties.
Keywords :
data mining; graph theory; set theory; GFCG; apriori-based algorithm; association graph; candidate set generation; data mining; graph-based algorithm; graph-based frequent closed itemset generation; repeated database scan; Association rules; Data engineering; Data mining; Itemsets; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems and Information Engineering Design Symposium, 2003 IEEE
Print_ISBN :
0-9744559-0-3
Type :
conf
DOI :
10.1109/SIEDS.2003.157999
Filename :
1242394
Link To Document :
بازگشت