Title :
Performance Analysis of Parallel Apriori on Heterogeneous Nodes
Author :
Shah, Ketan D. ; Mahajan, Mrs Sunita
Author_Institution :
Inf. Technol. Dept., SVKM´´s NMIMS Univ., Mumbai, India
Abstract :
The paper analyzes the performance of parallel apriori algorithm on heterogeneous nodes with different datasets and over n processors on a commodity cluster of machines. In the apriori algorithm all processes need to synchronize after every pass. If any process is assigned more load than other processes in the system, the slowest process will dictate the speed of the program. It is therefore important to ensure that load is equally balanced among all processes. Memory, speed of the processor and cache play a significant role in the processing capacity of the system. The experiments show that nodes with different configurations affect the performance of the parallel apriori algorithm. In order to maximize the efficiency it is required to balance the data set based on the processing speed of the various nodes present in the cluster.
Keywords :
data mining; datasets; heterogeneous nodes; parallel apriori algorithm; Algorithm design and analysis; Association rules; Clustering algorithms; Concurrent computing; Data mining; Itemsets; Load management; Parallel algorithms; Performance analysis; Spatial databases; Apriori Algorithm; Association Rules; Commodity Cluster; Data Mining; Load Balancing; Parallel Mining;
Conference_Titel :
Advances in Computing, Control, & Telecommunication Technologies, 2009. ACT '09. International Conference on
Conference_Location :
Trivandrum, Kerala
Print_ISBN :
978-1-4244-5321-4
Electronic_ISBN :
978-0-7695-3915-7
DOI :
10.1109/ACT.2009.20