DocumentCode
3649277
Title
Efficient Statistical Computing on Multicore and MultiGPU Systems
Author
Yulong Ou;Bo Li;Hailong Yang;Zhongzhi Luan;Depei Qian
Author_Institution
Dept. of Comput. Sci. &
fYear
2012
Firstpage
709
Lastpage
714
Abstract
As a statistical programming language for data analysis with a powerful graphics toolkit, R has been widely used in mathematical computing, biology simulation and medicine research. For large-scale computing such as drug discovery and protein folding, R is not good enough since it usually runs on a desktop computer. The situation gets worse when R runs on a single machine, while other computing is done on a cluster or even a supercomputer. In this paper, a parallel computing schema was proposed that R running on both CPU and GPU clusters, which have shown high multi-threaded performance while enabling high parallelism with lower energy consuming. The three statistical algorithms: chi-squared distribution, Pearson correlation coefficient and unary linear regression model were rewritten. Evaluation shows that our implementation exhibits superior performance and energy-efficiency than the single-threaded competitors. For instance, when the size of input dataset reaches 400M, the MPI implementation of the chi-squared distribution on a cluster with four nodes achieves a speedup of nearly 20x, while the CUDA implementation achieves a speedup of 5.2x on a single-GPU, and more than 15x on a system with three GPUs.
Keywords
"Graphics processing units","Algorithm design and analysis","Multicore processing","Clustering algorithms","Parallel processing","Instruction sets"
Publisher
ieee
Conference_Titel
Network-Based Information Systems (NBiS), 2012 15th International Conference on
Print_ISBN
978-1-4673-2331-4
Type
conf
DOI
10.1109/NBiS.2012.89
Filename
6354911
Link To Document