• DocumentCode
    3767414
  • Title

    Reducing Communication and Merging Overheads for Distributed Clustering Algorithms on the Cloud

  • Author

    Chun-Chieh Chen;Tze-Yu Chen;Jen-Wei Huang;Ming-Syan Chen

  • Author_Institution
    Grad. Inst. of Networking &
  • fYear
    2015
  • Firstpage
    41
  • Lastpage
    48
  • Abstract
    Many distributed clustering algorithms have been proposed to speed up data clustering on huge database. However, the existing distributed clustering algorithms still suffer from many issues on distributed system such as data synchronization, insufficient scalability, and maintenance difficulties. In this paper, we propose two distributed clustering algorithms named DDC and DGC, which are based on the cloud computing technique. The main ideas of proposed algorithms are to achieve load balance according to an efficient data partition, to cluster more data on many machines in parallel without data dependency, and to merge the result on a machine efficiently with minimal information overlap. The experimental results show that DDC and DGC are able to reduce the execution time and achieve great scalability on the cloud.
  • Keywords
    "Clustering algorithms","Partitioning algorithms","Algorithm design and analysis","Cloud computing","Distributed databases","Scalability","Merging"
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing and Big Data (CCBD), 2015 International Conference on
  • Type

    conf

  • DOI
    10.1109/CCBD.2015.9
  • Filename
    7450529