DocumentCode
69474
Title
GPU Cluster Implementation of FMM-FFT for Large-Scale Electromagnetic Problems
Author
Vinh Dang ; Nguyen, Quang M. ; Kilic, Ozlem
Author_Institution
EECS, Catholic Univ. of America, Washington, DC, USA
Volume
13
fYear
2014
fDate
2014
Firstpage
1259
Lastpage
1262
Abstract
The fast multipole method (FMM) combined with fast Fourier transform (FFT) is investigated for the solution of large-scale electromagnetic problems, which require high computational capability that cannot be accommodated using conventional computing systems. The implementation is parallelized on a 13-node graphics processing unit (GPU) cluster that populates Nvidia Tesla M2090 GPUs. The experimental results based on our FMM-FFT implementation on GPUs demonstrate up to 957 times speedup compared to that of the single-core, single-node CPU implementation. The implementation details and the performance achievements in terms of accuracy, speedup, and scalability are discussed.
Keywords
computational electromagnetics; electromagnetic wave scattering; fast Fourier transforms; graphics processing units; FMM-FFT; GPU cluster; Nvidia Tesla M2090 GPUs; fast Fourier transform; fast multipole method; graphics processing unit; large-scale electromagnetic scattering problems; single-core single-node CPU; Antennas; Electromagnetics; Graphics processing units; Method of moments; Rough surfaces; Scalability; Surface roughness; Fast Fourier transform (FFT); fast nultipole method (FMM); graphics processing unit (GPU) clusters; iterative solvers; method of moments (MoM);
fLanguage
English
Journal_Title
Antennas and Wireless Propagation Letters, IEEE
Publisher
ieee
ISSN
1536-1225
Type
jour
DOI
10.1109/LAWP.2014.2332972
Filename
6843872
Link To Document