DocumentCode
40092
Title
Performance Analysis of Multi-GPU Implementations of Krylov-Subspace Methods Applied to FEA of Electromagnetic Phenomena
Author
Peixoto de Camargos, Ana Flavia ; Silva, Viviane Cristine
Author_Institution
Escola Politec. da Univ. de Sao Paulo, Sao Paulo, Brazil
Volume
51
Issue
3
fYear
2015
fDate
Mar-15
Firstpage
1
Lastpage
4
Abstract
We present a comparison of performances of various graphic processing unit (GPU)-CUDA implementations of preconditioned Krylov-subspace methods, showing the impact of using a multi-GPU configuration. We aim to show that this resource allows the massively parallelized solution of large-scale real-world problems in state-of-the-art desktop PCs, since it overcomes the low-memory capacity of a single GPU, while still providing significant speedups when compared with either the usual sequential execution on a single-core CPU or an OpenMP implementation with four cores. The methods were selected based on their suitability to solve large-scale systems of equations arising from the 3-D finite-element analysis of open-bound earth current diffusion problems, both in steady state and under time-harmonic loading. In the latter, an ungauged harmonic edge-element formulation using the magnetic vector potential and the electric scalar potential was used. As the preconditioners suitable to this case, based on incomplete factorizations, are not appropriate for parallelization, we propose a hybrid CPU-GPU scheme to solve such problems, which still exhibits a competitive performance in low-cost PC desktops.
Keywords
finite element analysis; graphics processing units; parallel architectures; 3D finite-element analysis; FEA; GPU-CUDA implementations; OpenMP implementation; electric scalar potential; electromagnetic phenomena; graphic processing unit; hybrid CPU-GPU scheme; low-cost PC desktops; low-memory capacity; magnetic vector potential; multi-GPU implementations; open-bound earth current diffusion problem; parallelized solution; preconditioned Krylov-subspace method; sequential execution; single-core CPU; steady state; time-harmonic loading; ungauged harmonic edge-element formulation; Graphics processing units; Harmonic analysis; Iterative methods; Linear systems; Random access memory; Runtime; Silicon carbide; Edge elements; Krylov-subspace solver; finite-element (FE) method; graphic processing unit (GPU);
fLanguage
English
Journal_Title
Magnetics, IEEE Transactions on
Publisher
ieee
ISSN
0018-9464
Type
jour
DOI
10.1109/TMAG.2014.2363047
Filename
7093405
Link To Document