DocumentCode :
167532
Title :
Nuclear Fusion Simulation Code Optimization and Performance Evaluation on GPU Cluster
Author :
Fujita, Norihisa ; Nuga, Hideo ; Boku, Taisuke ; Idomura, Yasuhiro
Author_Institution :
Grad. Sch. of Syst. & Inf. Eng., Univ. of Tsukuba, Tsukuba, Japan
fYear :
2014
fDate :
19-23 May 2014
Firstpage :
1266
Lastpage :
1274
Abstract :
The conservative global gyrokinetic toroidal full-f five-dimensional Vlasov simulation (GT5D) is a nuclear fusion simulation program designed to analyze turbulence phenomena in tokamak plasma. In this research, we optimize it for graphics processing unit (GPU) clusters with multiple GPUs on each node. Based on the profile results of a GT5D on a CPU node, it was decided to offload the entire time development part of the program to GPUs, except for MPI communication. Our evaluation results show we achieved a maximum 3.35 times faster performance with a GPU during a function level execution, and 1.91 times faster total performance, than could be achieved via CPU-only execution, both in measurements on high density GPU cluster HA-PACS, where each computation node consists of four NVIDIA M2090 GPUs and two Intel Xeon E5-2670 (SandyBridge) that provide 16 cores in total. Note that theses performance improvements for a single GPU were obtained in measurements against four CPU cores, not a single-core CPU, and include a 63% performance gain obtained by communications overlapping between MPI processes and GPU calculations.
Keywords :
Tokamak devices; application program interfaces; graphics processing units; nuclear engineering computing; nuclear fusion; plasma toroidal confinement; GPU cluster; GT5D; HA-PACS; Intel Xeon E5-2670; MPI communication; NVIDIA M2090 GPU; global gyrokinetic toroidal full-f five-dimensional Vlasov simulation; graphics processing unit; nuclear fusion simulation code optimization; performance evaluation; tokamak plasma; turbulence phenomena; Computational modeling; Data transfer; Fusion reactors; Graphics processing units; Instruction sets; Kernel; Plasmas; CUDA; GPGPU; Nuclear Fusion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4799-4117-9
Type :
conf
DOI :
10.1109/IPDPSW.2014.142
Filename :
6969524
Link To Document :
بازگشت