DocumentCode
2037862
Title
GPU-based Arnoldi factorisation for accelerating finite element eigenanalysis
Author
Lezar, E. ; Davidson, D.B.
Author_Institution
Dept. of Electr. & Electron. Eng., Univ. of Stellenbosch, Stellenbosch, South Africa
fYear
2009
fDate
14-18 Sept. 2009
Firstpage
380
Lastpage
383
Abstract
We present a GPU-accelerated implementation of the k-step Arnoldi factorisation that forms the basis of a number of iterative eigenvalue system solvers. These solvers are important for the finite element analysis of the cutoff and dispersion characteristics of waveguide structures as well as cavity resonances and since they contribute significantly to the runtime in computing a solution, their acceleration is of interest. The initial GPU-based implementation makes use of accelerated BLAS routines for the CUDA API from NVIDIA (cublas). This allows us to utilise the computational power of the GPU at a functional level as a proof of concept with minimal coding effort. The implementation is then refined to make use of enhancements to the matrix-vector multiplication routines proposed by Fujimoto further improving performance.
Keywords
computational electromagnetics; coprocessors; eigenvalues and eigenfunctions; finite element analysis; iterative methods; mathematics computing; matrix multiplication; vectors; waveguides; CUDA API; GPU-based Arnoldi factorisation; NVIDIA; accelerated BLAS routines; computational electromagnetics; cublas; finite element eigenanalysis acceleration; iterative eigenvalue system solvers; matrix-vector multiplication routines; waveguide structures; Acceleration; Computational electromagnetics; Eigenvalues and eigenfunctions; Electromagnetic heating; Electromagnetic waveguides; Finite element methods; Matrix decomposition; Resonance; Runtime; Transmission line matrix methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Electromagnetics in Advanced Applications, 2009. ICEAA '09. International Conference on
Conference_Location
Torino
Print_ISBN
978-1-4244-3385-8
Electronic_ISBN
978-1-4244-3386-5
Type
conf
DOI
10.1109/ICEAA.2009.5297413
Filename
5297413
Link To Document