DocumentCode :
2867147
Title :
An Algorithm of Mining Long Frequent Itemsets Based on Digit Sequence
Author :
Fang, Gang ; Xiong, Jiang ; Tu, Cheng-Sheng ; Wu, Yuan-bin
Author_Institution :
Coll. of Math & Comput. Sci., Chongqing Three Gorges Univ., Chongqing, China
fYear :
2009
fDate :
19-20 Dec. 2009
Firstpage :
1
Lastpage :
4
Abstract :
In order to fast generate candidate frequent itemsets, avoid redundant calculation and reduce the time of scanning database, this paper proposes an algorithm of mining frequent itemsets based on digit sequence, which is suitable for mining long frequent itemsets. The algorithm firstly turns all transactions into digital transactions by binary, and then computing digit sequence of every attribute item. Finally, the algorithm uses the method of forming digital pure subset of digital transaction to generate candidate frequent itemsets by declining the value of digital pure subset, and uses the method of computing dimension of digit sequence of attribute items to compute support of candidate frequent itemsets, this method is used to reduce the time of scanning transactions database. The algorithm only scans once database when mining all these association rules, which is different from presented algorithms of mining long frequent itemsets. The experiment indicates that the efficiency of the algorithm is faster and more efficient than presented algorithms of association rules mining.
Keywords :
data mining; deductive databases; association rules mining; digit sequence; digital pure subset method; frequent itemsets mining; scanning transactions database; Association rules; Binary codes; Computer science; Data mining; Educational institutions; Itemsets; Logic; Transaction databases; Turning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4994-1
Type :
conf
DOI :
10.1109/ICIECS.2009.5366415
Filename :
5366415
Link To Document :
بازگشت