Title :
An Optimized GP-GPU Warp Scheduling Algorithm for Sparse Matrix-Vector Multiplication
Author :
Lifeng Liu ; Meilin Liu ; Chong-Jun Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Wright State Univ., Dayton, OH, USA
Abstract :
GP-GPUs have been used as the platform for many applications due to their powerful computation ability and massively parallel features. In this paper, we first investigate the CSR sparse matrix format, the performance of existing optimized SpMV (Sparse matrix-vector multiplication) algorithms, and analyze the memory access patterns of the SpMV algorithms. Based on the analysis of the memory access patterns, we propose a new thread scheduling technique that can take advantage of inter-warp locality and intra-warp locality simultaneously, and also can achieve memory coalescing automatically. This proposed new scheduling technique will change the memory access pattern of SpMVs significantly. The simulation results show that the performance of the SpMV using the new proposed thread scheduling technique achieves much better performance than the implementation of the SpMV optimized by other techniques.
Keywords :
graphics processing units; mathematics computing; matrix multiplication; multiprocessing systems; scheduling; vectors; CSR sparse matrix format; general purpose graphics processing unit; inter-warp locality; intra-warp locality; memory access patterns; memory coalescence; optimized GP-GPU warp scheduling algorithm; optimized SpMV algorithms; sparse matrix-vector multiplication; thread scheduling technique; Algorithm design and analysis; Arrays; Graphics processing units; Instruction sets; Sparse matrices; Vectors; GPU; SpMV; data locality; manycore; multicore; scheduling;
Conference_Titel :
Networking, Architecture and Storage (NAS), 2013 IEEE Eighth International Conference on
Conference_Location :
Xi´an
DOI :
10.1109/NAS.2013.35