Title :
An FP-split method for fast association rules mining
Author :
Lee, Chin-Feng ; Shen, Tsung-Hsien
Author_Institution :
Dept. of Inf. Manage., Chaoyang Univ. of Technol., Taichung, Taiwan
Abstract :
Recently, most of the studies on association rules mining focused on improving the efficiency of frequent itemsets generation. To our best knowledge, the FP-growth algorithm, which is based on the FP-tree to generate frequent itemsets is time-efficient. Currently, relevant studies are introduced to improve the FP-growth algorithm. However, they ignore the fact that the FP-tree construction may spend much time. Therefore, the goal of our research is to propose a fast algorithm called frequent pattern split, simply FP-split, for improving the process of the FP-tree construction. The proposed FP-split algorithm contains two main steps. The first step is to scan a transaction database only once for generating equivalence classes of frequent items. The second step is to sort these equivalence classes of frequent items in descending order so as to construct the FP-split tree. Through detailed experimental evaluations under various system conditions, our method shows excellent performance in terms of execution efficiency and scalability.
Keywords :
data mining; equivalence classes; sorting; transaction processing; tree data structures; trees (mathematics); FP-split method; FP-tree; association rule mining; equivalence classes; frequent itemsets generation; frequent pattern; sorting; transaction database scanning; Association rules; Chaos; Data mining; Electronic mail; Information analysis; Information management; Information technology; Itemsets; Scalability; Transaction databases;
Conference_Titel :
Information Technology: Research and Education, 2005. ITRE 2005. 3rd International Conference on
Print_ISBN :
0-7803-8932-8
DOI :
10.1109/ITRE.2005.1503165