DocumentCode
125677
Title
GPU Implementation of Inverse Iteration Algorithm for Computing Eigenvectors
Author
Ishigami, Hiroyuki ; Kimura, K. ; Nakamura, Yoshihiko
Author_Institution
Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan
fYear
2014
fDate
12-14 Feb. 2014
Firstpage
664
Lastpage
671
Abstract
Effective GPU implementations of an inverse iteration algorithm with reorthogonalization are proposed for computing eigenvectors of symmetric tridiagonal matrices. The key to effectively accelerating the inverse iteration algorithm in GPU computing is the adoption of reorthogonalization code optimal for the GPU. The CGS2 algorithm and the compact WY orthogonalization algorithm, which can be implemented using level 2 BLAS routines, are implemented using CUBLAS. The size of the data transferred between the CPU and GPU is also optimally reduced. The proposed code of the inverse iteration algorithm using the CGS2 algorithm is shown to map well to a GPU and to achieve high performance through numerical experiments on a CPU-GPU heterogeneous computer.
Keywords
graphics processing units; iterative methods; parallel processing; CGS2 algorithm; CPU-GPU heterogeneous computer; CUBLAS; GPU implementations; compact WY orthogonalization algorithm; inverse iteration algorithm; level 2 BLAS routines; reorthogonalization code; symmetric tridiagonal matrices; Acceleration; Clustering algorithms; Computational efficiency; Eigenvalues and eigenfunctions; Graphics processing units; Symmetric matrices; Vectors; GPU computing; eigenvector; inverse iteration; reorthogonalization; symmetric eingenvalue problem;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel, Distributed and Network-Based Processing (PDP), 2014 22nd Euromicro International Conference on
Conference_Location
Torino
ISSN
1066-6192
Type
conf
DOI
10.1109/PDP.2014.39
Filename
6787345
Link To Document