• DocumentCode
    2203787
  • Title

    Speeding up K-Means Algorithm by GPUs

  • Author

    Li, You ; Zhao, Kaiyong ; Chu, Xiaowen ; Liu, Jiming

  • Author_Institution
    Dept. of Comput. Sci., Hong Kong Baptist Univ., Hong Kong, China
  • fYear
    2010
  • fDate
    June 29 2010-July 1 2010
  • Firstpage
    115
  • Lastpage
    122
  • Abstract
    Cluster analysis plays a critical role in a wide variety of applications, but it is now facing the computational challenge due to the continuously increasing data volume. Parallel computing is one of the most promising solutions to overcoming the computational challenge. In this paper, we target at parallelizing k-Means, which is one of the most popular clustering algorithms, by using the widely available Graphics Processing Units (GPUs). Different from existing GPU-based k-Means algorithms, we observe that data dimension is an important factor that should be taken into consideration when parallelizing k-Means on GPUs. In particular, we use two different strategies for low-dimensional data sets and high-dimensional data sets respectively, in order to make the best use of the power of GPUs. For low-dimensional data sets, we exploit GPU on-chip registers to significantly decrease data access latency. For high-dimensional data sets, we design a novel algorithm which simulates matrix multiplication and exploits GPU on-chip registers and also on-chip shared memory to achieve high compute-to-memory-access ratio. As a result, our GPU-based k-Means algorithm is three to eight times faster than the best reported GPU-based algorithm.
  • Keywords
    learning (artificial intelligence); matrix algebra; multiprocessing systems; parallel processing; pattern clustering; GPU on-chip registers; cluster analysis; compute-to-memory-access ratio; graphics processing units; k-means algorithm; matrix multiplication; on-chip shared memory; parallel computing; Algorithm design and analysis; Clustering algorithms; Graphics processing unit; Instruction sets; Memory management; Registers; System-on-a-chip; CUDA; GPGPU; cluster; k-means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology (CIT), 2010 IEEE 10th International Conference on
  • Conference_Location
    Bradford
  • Print_ISBN
    978-1-4244-7547-6
  • Type

    conf

  • DOI
    10.1109/CIT.2010.60
  • Filename
    5578441