DocumentCode :
3728340
Title :
Parallel Black Hole Clustering Based on MapReduce
Author :
Chun-Wei Tsai;Cheng-Han Hsieh;Ming-Chao Chiang
Author_Institution :
Dept. of Comput. Sci. &
fYear :
2015
Firstpage :
2543
Lastpage :
2548
Abstract :
One of the key reasons that traditional clustering methods are inefficient for analyzing large-scale datasets is because most of them are designed for a centralized system. This means that if the size of input data exceeds the size of storage or memory of such a system, it would make the task of clustering much more difficult. To mitigate this problem, an efficient clustering algorithm, called MapReduce Black Hole (MRBH), is presented in this paper to leverages the strength of the black hole algorithm and the MapReduce programming model of Hadoop to accelerate the clustering speed by both software and hardware. By using MapReduce, MRBH will then divide a large dataset into a number of small data sets and cluster these smaller data sets in parallel. Moreover, MRBH inherits the characteristics of the black hole algorithm, meaning that no parameters are to be set manually, thus, the implementation is easy. To evaluate the performance of the proposed algorithm, several datasets are used with different numbers of nodes. Experimental results show that the proposed algorithm can provide a significant speedup as the number of nodes increases.
Keywords :
"Clustering algorithms","Algorithm design and analysis","Software algorithms","Programming","Computational modeling","Gravity"
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/SMC.2015.445
Filename :
7379577
Link To Document :
بازگشت