DocumentCode
3767414
Title
Reducing Communication and Merging Overheads for Distributed Clustering Algorithms on the Cloud
Author
Chun-Chieh Chen;Tze-Yu Chen;Jen-Wei Huang;Ming-Syan Chen
Author_Institution
Grad. Inst. of Networking &
fYear
2015
Firstpage
41
Lastpage
48
Abstract
Many distributed clustering algorithms have been proposed to speed up data clustering on huge database. However, the existing distributed clustering algorithms still suffer from many issues on distributed system such as data synchronization, insufficient scalability, and maintenance difficulties. In this paper, we propose two distributed clustering algorithms named DDC and DGC, which are based on the cloud computing technique. The main ideas of proposed algorithms are to achieve load balance according to an efficient data partition, to cluster more data on many machines in parallel without data dependency, and to merge the result on a machine efficiently with minimal information overlap. The experimental results show that DDC and DGC are able to reduce the execution time and achieve great scalability on the cloud.
Keywords
"Clustering algorithms","Partitioning algorithms","Algorithm design and analysis","Cloud computing","Distributed databases","Scalability","Merging"
Publisher
ieee
Conference_Titel
Cloud Computing and Big Data (CCBD), 2015 International Conference on
Type
conf
DOI
10.1109/CCBD.2015.9
Filename
7450529
Link To Document