• DocumentCode
    2896283
  • Title

    Dynamic load balancing on GPU clusters for large-scale K-Means clustering

  • Author

    Kijsipongse, Ekasit ; U-ruekolan, Suriya

  • Author_Institution
    Large-Scale Simulation Res. Lab., Nat. Electron. & Comput. Technol. Center (NECTEC), Pathumthani, Thailand
  • fYear
    2012
  • fDate
    May 30 2012-June 1 2012
  • Firstpage
    346
  • Lastpage
    350
  • Abstract
    K-Means is the clustering algorithm which is widely used in many areas such as information retrieval, computer vision and pattern recognition. With the recent advance in General Purpose Graphics Processing Unit (GPGPU), we can use a modern GPU which is capable to do computation up to Tflops to calculate K-Means clustering on average problems. However, due to the exponential growth of data, the K-Means clustering on a single GPU will not be adequate for large datasets in the near future. In this paper, we present the design and implementation of an efficient large-scale parallel K-Means on GPU clusters. We utilize the massive parallelism in GPUs to speed up the most time consuming part of K-Means clustering in each node. We employ the dynamic load balancing to distribute workload equally on different GPUs installed in the clusters so as to improve the performance of the parallel K-Means at the inter-node level. We also take advantage from software distributed shared memory to simplify the communication and collaboration among nodes. The result of the evaluation shows the performance improvement of the parallel K-Means by maintaining load balance on GPU clusters.
  • Keywords
    distributed shared memory systems; graphics processing units; parallel processing; pattern clustering; resource allocation; software performance evaluation; GPU clusters; dynamic load balancing; general purpose graphics processing unit; inter-node level performance improvement; large-scale parallel K-means clustering; nodes collaboration; software distributed shared memory; workload distribution; Arrays; Clustering algorithms; Graphics processing unit; Kernel; Load management; Message systems; Performance evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Software Engineering (JCSSE), 2012 International Joint Conference on
  • Conference_Location
    Bangkok
  • Print_ISBN
    978-1-4673-1920-1
  • Type

    conf

  • DOI
    10.1109/JCSSE.2012.6261977
  • Filename
    6261977