Title :
K-Means on Commodity GPUs with CUDA
Author :
Hong-tao, Bai ; Li-li, He ; Dan-tong, Ouyang ; Zhan-shan, Li ; He, Li
Author_Institution :
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
fDate :
March 31 2009-April 2 2009
Abstract :
K-means algorithm is one of the most famous unsupervised clustering algorithms. Many theoretical improvements for the performance of original algorithms have been put forward, while almost all of them are based on single instruction single data (SISD) architecture processors (GPUs), which partly ignored the inherent paralleled characteristic of the algorithms. In this paper, a novel single instruction multiple data (SIMD) architecture processors (GPUs) based k-means algorithm is proposed. In this algorithm, in order to accelerate compute-intensive portions of traditional k-means, both data objects assignment and k-centroids recalculation are offloaded to the GPU in parallel. We have implemented this GPU-based k-means on the newest generation GPU with compute unified device architecture(CUDA). The numerical experiments demonstrated that the speed of GPU-based k-means could reach as high as 40 times of the CPU-based k-means.
Keywords :
computer architecture; computer graphic equipment; parallel processing; pattern clustering; architecture processor; compute unified device architecture; data objects assignment; graphics processor unit; k-centroids recalculation; k-means algorithm; single instruction multiple data; single instruction single data; unsupervised clustering algorithms; Acceleration; Clustering algorithms; Computer architecture; Computer science; Concurrent computing; Graphics; Hardware; Helium; Kernel; Yarn; CUDA; GPU; K-means; SIMD;
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
DOI :
10.1109/CSIE.2009.491