Title :
K-means implementation on FPGA for high-dimensional data using triangle inequality
Author :
Lin, Zhongduo ; Lo, Charles ; Chow, Paul
Author_Institution :
Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
Abstract :
One of the challenges to data mining raised by technology development is that both data size and dimensionality is growing rapidly. K-means, one of the most popular clustering algorithms in data mining, suffers in computational time when used for large data sets and data with high dimensionality. In this paper, we propose a hardware architecture for K-means with triangle inequality optimization on FPGA. An optimal 8-bit square calculator for 6-LUT architectures is described to minimize the hardware cost and an approximation solution is proposed to avoid square root calculation in the original triangle inequality optimization. Our software and hardware experiments are tested with the MNIST benchmark and uniform random numbers of various size. This approximation results in 2% more distance calculations for MNIST and 5% for uniform random numbers than the original optimization. Compared to the baseline hardware system without optimization, our approach achieves up to 77% improvement in processing time with about 10% logic overhead. We demonstrate that the hardware can achieve 55-fold speed up compared to software for the 1024 MNIST.
Keywords :
approximation theory; data mining; field programmable gate arrays; optimisation; pattern clustering; unsupervised learning; 6-LUT architectures; FPGA; MNIST benchmark; approximation solution; data dimensionality; data mining; data sets; data size; hardware architecture; hardware cost minimization; high-dimensional data; k-means implementation; optimal 8-bit square calculator; technology development; triangle inequality optimization; uniform random numbers; unsupervised clustering algorithms; Adders; Approximation methods; Calculators; Clustering algorithms; Field programmable gate arrays; Hardware; Optimization;
Conference_Titel :
Field Programmable Logic and Applications (FPL), 2012 22nd International Conference on
Conference_Location :
Oslo
Print_ISBN :
978-1-4673-2257-7
Electronic_ISBN :
978-1-4673-2255-3
DOI :
10.1109/FPL.2012.6339141