DocumentCode :
2979655
Title :
Accelerating Volkov´s Hybrid Implementation of Cholesky Factorization on a Fermi GPU
Author :
Shih-Chieh Wei ; Bormin Huang
Author_Institution :
Dept. of Inf. Manage., Tamkang Univ., Tamsui, Taiwan
fYear :
2012
fDate :
17-19 Dec. 2012
Firstpage :
896
Lastpage :
900
Abstract :
In linear algebra, Cholesky factorization is useful in solving a system of equations with a symmetric positive definite coefficient matrix. Cholesky factorization is roughly twice as fast relative to LU factorization which applies to general matrices. In recent years, with advances in technology, a Fermi GPU card can accommodate hundreds of cores compared to the small number of 8 or 16 cores on CPU. Therefore a trend is seen to use the graphics card as a general purpose graphics processing unit (GPGPU) for parallel computation. In this work, Volkov´s hybrid implementation of Cholesky factorization is evaluated on the new Fermi GPU with others and then some improvement strategies were proposed. After experiments, compared to the CPU version using Intel Math Kernel Library (MKL), our proposed GPU improvement strategy can achieve a speedup of 3.85x on Cholesky factorization of a square matrix of dimension 10,000.
Keywords :
graphics processing units; matrix algebra; parallel processing; Cholesky factorization; Fermi GPU; GPGPU; Intel Math Kernel Library; MKL; accelerating Volkov hybrid implementation; general matrices; general purpose graphics processing unit; parallel computation; square matrix; symmetric positive definite coefficient matrix; Educational institutions; Graphics processing units; Instruction sets; Kernel; Libraries; Linear algebra; Symmetric matrices; Cholesky factorization; general purpose graphics processing unit; parallel computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2012 IEEE 18th International Conference on
Conference_Location :
Singapore
ISSN :
1521-9097
Print_ISBN :
978-1-4673-4565-1
Electronic_ISBN :
1521-9097
Type :
conf
DOI :
10.1109/ICPADS.2012.147
Filename :
6413585
Link To Document :
بازگشت