DocumentCode :
3028136
Title :
Detection and GPU accelerationof 3D FDTD algorithms based on memory access patterns
Author :
Ran Shao ; Linton, D. ; Spence, Ivor ; Milligan, P. ; Ning Zheng
Author_Institution :
Sch. of Electron., Electr. Eng. & Comput. Sci., Queen´s Univ. of Belfast, Belfast, UK
fYear :
2013
fDate :
20-22 Dec. 2013
Firstpage :
2520
Lastpage :
2526
Abstract :
A semi-automatic tool is reported that first analyzes the sequential FDTD program to obtain memory access patterns and related features, and then optimizes the FDTD program with combined use of several types of CUDA memory on both Fermi and Kepler architecture GPUs. The experiments show a 13% and 18% speedup using Fermi and Kepler GPUs respectively compared to the GPU version program without optimization. Up to 142 times speedup is achieved compared to the sequential FDTD C program at a FDTD 3D mesh size of 250* 250* 250 (15.625 million mesh cells) with 10 layers CPML boundary conditions in 4096 time steps.
Keywords :
finite difference time-domain analysis; graphics processing units; mathematics computing; mesh generation; parallel architectures; 3D FDTD algorithms; CPML boundary conditions; CUDA memory; FDTD 3D mesh size; Fermi architecture GPU; GPU acceleration; Kepler architecture GPU; convolutional perfect matching layer boundary conditions; finite-difference time-domain method; memory access patterns; semiautomatic tool; sequential FDTD C program; sequential FDTD program; Acceleration; Finite difference methods; Graphics processing units; Memory management; System-on-chip; Three-dimensional displays; Time-domain analysis; CUDA 5.0; FDTD; LLVM; Memory access pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mechatronic Sciences, Electric Engineering and Computer (MEC), Proceedings 2013 International Conference on
Conference_Location :
Shengyang
Print_ISBN :
978-1-4799-2564-3
Type :
conf
DOI :
10.1109/MEC.2013.6885460
Filename :
6885460
Link To Document :
بازگشت