DocumentCode :
2336102
Title :
Fast parallel association rule mining without candidacy generation
Author :
Zaïane, Osmar R. ; El-Hajj, Mohammad ; Lu, Paul
Author_Institution :
Alberta Univ., Edmonton, Alta., Canada
fYear :
2001
fDate :
2001
Firstpage :
665
Lastpage :
668
Abstract :
In this paper we introduce a new parallel algorithm MLFPT (multiple local frequent pattern tree) for parallel mining of frequent patterns, based on FP-growth mining, that uses only two full I/O scans of the database, eliminating the need for generating candidate items, and distributing the work fairly among processors. We have devised partitioning strategies at different stages of the mining process to achieve near optimal balancing between processors. We have successfully tested our algorithm on datasets larger than 50 million transactions
Keywords :
data mining; parallel algorithms; resource allocation; very large databases; FP-growth mining; I/O scans; MLFPT parallel algorithm; datasets; fast parallel association rule mining; frequent patterns; multiple local frequent pattern tree; optimal processor balancing; partitioning strategies; transactions; Association rules; Data mining; Itemsets; Marketing and sales; Memory architecture; Parallel algorithms; Partitioning algorithms; Recommender systems; Testing; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
Type :
conf
DOI :
10.1109/ICDM.2001.989600
Filename :
989600
Link To Document :
بازگشت