Title :
An Algorithm of Mining Long Frequent Itemsets Based on Digit Sequence
Author :
Fang, Gang ; Xiong, Jiang ; Tu, Cheng-Sheng ; Wu, Yuan-bin
Author_Institution :
Coll. of Math & Comput. Sci., Chongqing Three Gorges Univ., Chongqing, China
Abstract :
In order to fast generate candidate frequent itemsets, avoid redundant calculation and reduce the time of scanning database, this paper proposes an algorithm of mining frequent itemsets based on digit sequence, which is suitable for mining long frequent itemsets. The algorithm firstly turns all transactions into digital transactions by binary, and then computing digit sequence of every attribute item. Finally, the algorithm uses the method of forming digital pure subset of digital transaction to generate candidate frequent itemsets by declining the value of digital pure subset, and uses the method of computing dimension of digit sequence of attribute items to compute support of candidate frequent itemsets, this method is used to reduce the time of scanning transactions database. The algorithm only scans once database when mining all these association rules, which is different from presented algorithms of mining long frequent itemsets. The experiment indicates that the efficiency of the algorithm is faster and more efficient than presented algorithms of association rules mining.
Keywords :
data mining; deductive databases; association rules mining; digit sequence; digital pure subset method; frequent itemsets mining; scanning transactions database; Association rules; Binary codes; Computer science; Data mining; Educational institutions; Itemsets; Logic; Transaction databases; Turning;
Conference_Titel :
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4994-1
DOI :
10.1109/ICIECS.2009.5366415