Title :
Fast parallel association rule mining without candidacy generation
Author :
Zaïane, Osmar R. ; El-Hajj, Mohammad ; Lu, Paul
Author_Institution :
Alberta Univ., Edmonton, Alta., Canada
Abstract :
In this paper we introduce a new parallel algorithm MLFPT (multiple local frequent pattern tree) for parallel mining of frequent patterns, based on FP-growth mining, that uses only two full I/O scans of the database, eliminating the need for generating candidate items, and distributing the work fairly among processors. We have devised partitioning strategies at different stages of the mining process to achieve near optimal balancing between processors. We have successfully tested our algorithm on datasets larger than 50 million transactions
Keywords :
data mining; parallel algorithms; resource allocation; very large databases; FP-growth mining; I/O scans; MLFPT parallel algorithm; datasets; fast parallel association rule mining; frequent patterns; multiple local frequent pattern tree; optimal processor balancing; partitioning strategies; transactions; Association rules; Data mining; Itemsets; Marketing and sales; Memory architecture; Parallel algorithms; Partitioning algorithms; Recommender systems; Testing; Transaction databases;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989600