Title :
A parallel algorithm for frequent itemset mining
Author :
Li, Li ; Zhai, Donghai ; Jin, Fan
Author_Institution :
Sch. of Comput. & Commun. Eng., Southwest Jiaotong Univ., Chengdu, China
Abstract :
Frequent itemsets mining plays an essential role in data mining. A new algorithm PFP-growth (parallel FP-growth), which is based on the improved FP-growth, is proposed for parallel frequent itemset mining. The new algorithm distributes the task fairly among the parallel processors. We devise partitioning strategies at different stages of the mining process to achieve balance between processors and adopt some data structure to reduce the information transportation between processors. The experiments on national high performance parallel computer show that the PFP-growth is an efficient parallel algorithm for mining frequent itemset.
Keywords :
data mining; data structures; parallel algorithms; PFP-growth algorithm; data mining; data structure; frequent itemset mining; information transportation; parallel FP-growth algorithm; parallel algorithm; parallel computer; parallel processors; Broadcasting; Data mining; Itemsets; Merging; Parallel algorithms; Transportation;
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies, 2003. PDCAT'2003. Proceedings of the Fourth International Conference on
Print_ISBN :
0-7803-7840-7
DOI :
10.1109/PDCAT.2003.1236435