Title :
Implementing Sparse Matrix-Vector multiplication using CUDA based on a hybrid sparse matrix format
Author :
Cao, Wei ; Yao, Lu ; Li, Zongzhe ; Wang, Yongxian ; Wang, Zhenghua
Author_Institution :
Nat. Key Lab. for Parallel & Distrib. Process., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
The Sparse Matrix-Vector product (SpMV) is a key operation in engineering and scientific computing. Methods for efficiently implementing it in parallel are critical to the performance of many applications. Modern Graphics Processing Units (GPUs) coupled with the advent of general purpose programming environments like NVIDIA´s CUDA, have gained interest as a viable architecture for data-parallel general purpose computations. Currently, SpMV implementations using CUDA based on common sparse matrix format have already appeared. Among them, the performance of implementation based on ELLPACK-R format is the best. However, in this implementation, when the maximum number of nonzeros per row does substantially differ from the average, thread is suffering from load imbalance. This paper proposes a new matrix storage format called ELLPACK-RP, which combines ELLPACK-R format with JAD format, and implements the SpMV using CUDA based on it. The result proves that it can decrease the load imbalance and improve the SpMV performance efficiently.
Keywords :
computer graphic equipment; coprocessors; mathematics computing; matrix multiplication; parallel architectures; performance evaluation; sparse matrices; vectors; CUDA; ELLPACK-R format; GPU; JAD format; compute unified device architecture; data parallel general purpose computation; general purpose programming environment; load imbalance; modern graphics processing unit; sparse matrix vector multiplication; Arrays; Artificial neural networks; Graphics; Graphics processing unit; Instruction sets; Kernel; Sparse matrices; CUDA; ELLPACKRP; GPU; SpMV; matrix format;
Conference_Titel :
Computer Application and System Modeling (ICCASM), 2010 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4244-7235-2
Electronic_ISBN :
978-1-4244-7237-6
DOI :
10.1109/ICCASM.2010.5623237