DocumentCode
644403
Title
An Optimized GP-GPU Warp Scheduling Algorithm for Sparse Matrix-Vector Multiplication
Author
Lifeng Liu ; Meilin Liu ; Chong-Jun Wang
Author_Institution
Dept. of Comput. Sci. & Eng., Wright State Univ., Dayton, OH, USA
fYear
2013
fDate
17-19 July 2013
Firstpage
222
Lastpage
231
Abstract
GP-GPUs have been used as the platform for many applications due to their powerful computation ability and massively parallel features. In this paper, we first investigate the CSR sparse matrix format, the performance of existing optimized SpMV (Sparse matrix-vector multiplication) algorithms, and analyze the memory access patterns of the SpMV algorithms. Based on the analysis of the memory access patterns, we propose a new thread scheduling technique that can take advantage of inter-warp locality and intra-warp locality simultaneously, and also can achieve memory coalescing automatically. This proposed new scheduling technique will change the memory access pattern of SpMVs significantly. The simulation results show that the performance of the SpMV using the new proposed thread scheduling technique achieves much better performance than the implementation of the SpMV optimized by other techniques.
Keywords
graphics processing units; mathematics computing; matrix multiplication; multiprocessing systems; scheduling; vectors; CSR sparse matrix format; general purpose graphics processing unit; inter-warp locality; intra-warp locality; memory access patterns; memory coalescence; optimized GP-GPU warp scheduling algorithm; optimized SpMV algorithms; sparse matrix-vector multiplication; thread scheduling technique; Algorithm design and analysis; Arrays; Graphics processing units; Instruction sets; Sparse matrices; Vectors; GPU; SpMV; data locality; manycore; multicore; scheduling;
fLanguage
English
Publisher
ieee
Conference_Titel
Networking, Architecture and Storage (NAS), 2013 IEEE Eighth International Conference on
Conference_Location
Xi´an
Type
conf
DOI
10.1109/NAS.2013.35
Filename
6665367
Link To Document