DocumentCode :
2053442
Title :
Hash based parallel algorithms for mining association rules
Author :
Shintani, Takahiko ; Kitsuregawa, Masaru
Author_Institution :
Inst. of Ind. Sci., Tokyo Univ., Japan
fYear :
1996
fDate :
18-20 Dec 1996
Firstpage :
19
Lastpage :
30
Abstract :
We propose four parallel algorithms (NPA, SPA, HPA and HPA-ELD) for mining association rules on shared nothing parallel machines to improve its performance. In NPA, candidate itemsets are just copied amongst all the processors, which can lead to memory overflow for large transaction databases. The remaining three algorithms partition the candidate itemsets over the processors. If it is partitioned simply (SPA), transaction data has to be broadcast to all processors. HPA partitions the candidate itemsets using a hash function to eliminate broadcasting, which also reduces the comparison workload significantly. HPA-ELD fully utilizes the available memory space by detecting the extremely large itemsets and copying them, which is also very effective at flattering the load over the processors. We implemented these algorithms in a shared nothing environment. Performance evaluations show that the best algorithm, HPA-ELD, attains good linearity on speedup ratio and is effective for handling skew
Keywords :
deductive databases; file organisation; knowledge acquisition; parallel algorithms; parallel machines; HPA; HPA-ELD; NPA; SPA; association rule mining; broadcasting; candidate itemsets; hash based parallel algorithms; hash function; large transaction databases; memory overflow; memory space; performance evaluations; shared nothing environment; shared nothing parallel machines; Association rules; Broadcasting; Data analysis; Data mining; Itemsets; Mining industry; Parallel algorithms; Parallel machines; Partitioning algorithms; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Information Systems, 1996., Fourth International Conference on
Conference_Location :
Miami Beach, FL
Print_ISBN :
0-8186-7475X
Type :
conf
DOI :
10.1109/PDIS.1996.568664
Filename :
568664
Link To Document :
بازگشت