مرکز منطقه ای اطلاع رساني علوم و فناوري - Iterative Methods for Sparse Linear Systems on Graphics Processing Unit

DocumentCode :

1828535

Title :

Iterative Methods for Sparse Linear Systems on Graphics Processing Unit

Author :

Cheik Ahamed, Abal-Kassim ; Magoules, Frederic

Author_Institution :

Appl. Math. & Syst. Lab., Ecole Centrale Paris Chatenay-Malabry, Paris, France

fYear :

2012

fDate :

25-27 June 2012

Firstpage :

836

Lastpage :

842

Abstract :

Many engineering and science problems require a computational effort to solve large sparse linear systems. Krylov subspace based iterative solvers have been widely used in that direction. Iterative Krylov methods involve linear algebra operations such as summation of vectors, dot product, norm, and matrix-vector multiplication. Since these operations could be very costly in computation time on Central Processing Unit (CPU), we propose in this paper to focus on the design of iterative solvers to take advantage of massive parallelism of Graphics Processing Unit (GPU). We consider Stabilized BiConjugate Gradient (BiCGStab), Stabilized BiConjugate Gradient (L) (BiCGStabl), Generalized Conjugate Residual (P-GCR), Bi-Conjugate Gradient Conjugate Residual (P-BiCGCR), transpose-free Quasi Minimal Residual (P-tfQMR) for the solution of sparse linear systems with non symmetric matrices and Conjugate Gradient (CG) for symmetric positive definite matrices. We discuss data format and data structure for sparse matrices, and how to efficiently implement these solvers on the Nvidia´s CUDA platform. The scalability and performance of the methods are tested on several engineering problems, together with numerous numerical experiments which clearly illustrate the robustness, competitiveness and efficiency of our own proper implementation compared to the existing libraries.

Keywords :

conjugate gradient methods; data structures; graphics processing units; iterative methods; matrix multiplication; parallel architectures; sparse matrices; vectors; BiCGStabl; CPU; GPU; Krylov subspace-based iterative solvers; Nvidia CUDA platform; P-BiCGCR; P-GCR; P-tfQMR; biconjugate gradient conjugate residual; central processing unit; computation time; data format; data structure; dot product; generalized conjugate residual; graphics processing unit; linear algebra operations; matrix-vector multiplication; nonsymmetric matrices; norms; sparse linear systems; stabilized biconjugate gradient l; symmetric positive definite matrices; transpose-free quasiminimal residual; vector summation; Arrays; Graphics processing unit; Instruction sets; Libraries; Sparse matrices; Vectors; CUBLAS; CUDA; CUSPARSE; Cusp; Krylov methods; graphics processing unit; linear algebra; sparse matrix-vector multiplication;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on

Conference_Location :

Liverpool

Print_ISBN :

978-1-4673-2164-8

Type :

conf

DOI :

10.1109/HPCC.2012.118

Filename :

6332256

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1828535