Title :
A Comparative Study of Preconditioners for GPU-Accelerated Conjugate Gradient Solver
Author :
Yao Chen ; Yonghua Zhao ; Wei Zhao ; Lian Zhao
Author_Institution :
Comput. Network Inf. Center, Univ. of Chinese Acad. of Sci., Beijing, China
Abstract :
We compare two types of preconditioners for GPU-Accelerated conjugate gradient solver. For the standard IC preconditioner, we exploit level scheduling to increase multi-thread parallelism of sparse triangular solve on GPU. Meanwhile, we propose a novel reordering technique to maximize the coalescing of global memory accesses. For the approximate inverse preconditioner SSOR-AI, we extend it to second order approximation. Experiments indicate that our IC PCG runs 25% faster than using vendor implementation in CUSPARSE library and SSOR-AI PCG can be twice as fast as IC PCG.
Keywords :
graphics processing units; multi-threading; scheduling; CUSPARSE library; GPU-accelerated conjugate gradient solver; IC PCG; SSOR-AI PCG; global memory accesses; inverse preconditioner SSOR-AI; level scheduling; multithread parallelism; reordering technique; second order approximation; standard IC preconditioner; Approximation methods; Graphics processing units; Integrated circuits; Linear systems; Optimization; Parallel processing; Sparse matrices; Approximate Inverse; Conjugate Gradient; GPU; Incomplete Cholesky; Preconditioner;
Conference_Titel :
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location :
Zhangjiajie
DOI :
10.1109/HPCC.and.EUC.2013.94