DocumentCode
1973372
Title
Performance analysis and optimization of Gyrokinetic Torodial Code on TH-1A supercomputer
Author
Zhu, Xiaoqian ; Liu, Xin ; Meng, Xiangfei ; Feng, Jinghua
Author_Institution
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
fYear
2011
fDate
16-18 Sept. 2011
Firstpage
6027
Lastpage
6031
Abstract
In this study, we test and analyze the performance of Gyrokinetic Torodial Code(GTC) program. According to the analysis results, we port GTC\´s compute-intensive subroutines to GPU and speed up them on the "CPU+GPU" heterogeneous architecture of TH-1A supercomputer. Some optimization strategies are developed in this process, for example, subroutines are integrated to reduce the data transfer between host and device, GPU memory access is optimized to reduce the access latency and static keyword is designed before arrays\´ declaration to avoid unnecessary address allocation and data copy. Experiment results show that the performance of the subroutines ported to GPU is improved evidently, which is between 6 and 8 times, and the total performance of GTC could be improved by 3 to 4 times.
Keywords
computer graphic equipment; coprocessors; parallel machines; performance evaluation; GPU; GPU memory access; GTC; Gyrokinetic Torodial Code optimisation; TH-1A supercomputer; array declaration; data transfer; optimization strategies; performance analysis; Algorithms; Arrays; Graphics processing unit; Multicore processing; Optimization; Supercomputers; GPU; GTC; TH-1A; high performance computing; nuclear fusion;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Control Engineering (ICECE), 2011 International Conference on
Conference_Location
Yichang
Print_ISBN
978-1-4244-8162-0
Type
conf
DOI
10.1109/ICECENG.2011.6057053
Filename
6057053
Link To Document