Title :
Accuracy, Memory, and Speed Strategies in GPU-Based Finite-Element Matrix-Generation
Author :
Dziekonski, Adam ; Sypek, Piotr ; Lamecki, Adam ; Mrozowski, Michal
Author_Institution :
Dept. of Microwave & Antenna Eng., Gdansk Univ. of Technol., Gdansk, Poland
fDate :
7/4/1905 12:00:00 AM
Abstract :
This letter presents strategies on how to optimize graphics processing unit (GPU)-based finite-element matrix-generation that occurs in the finite element method (FEM) using higher-order curvilinear elements. The goal of the optimization is to increase the speed of evaluation and assembly of large finite-element matrices on a single GPU while maintaining the accuracy of numerical integration at the desired level. For this reason, the choice of the optimal Gaussian quadratures for curvilinear finite elements focused on accuracy, memory usage, and runtime of numerical integration is discussed. Moreover, we show how to efficiently utilize symmetry of local mass and stiffness matrices on a GPU in the numerical integration step. The performance results, obtained on a workstation equipped with one Tesla C2075, indicate that the proposed strategies retain the accuracy of computations, allow generation of larger sparse linear systems, and provide 2.5-fold acceleration of GPU-based finite-element matrix-generation.
Keywords :
finite element analysis; graphics processing units; optimisation; sparse matrices; GPU-based finite-element matrix-generation; Tesla C2075; accuracy strategy; curvilinear finite element method; graphics processing unit; higher-order curvilinear element; memory strategy; numerical integration; numerical integration step; optimal Gaussian quadrature; optimization; sparse linear system; speed strategy; Accuracy; Assembly; Finite element methods; Graphics processing units; Memory management; Symmetric matrices; Transmission line matrix methods; Finite-element matrix-generation; Gaussian quadrature; graphics processing unit (GPU);
Journal_Title :
Antennas and Wireless Propagation Letters, IEEE
DOI :
10.1109/LAWP.2012.2227449