• DocumentCode
    1973372
  • Title

    Performance analysis and optimization of Gyrokinetic Torodial Code on TH-1A supercomputer

  • Author

    Zhu, Xiaoqian ; Liu, Xin ; Meng, Xiangfei ; Feng, Jinghua

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2011
  • fDate
    16-18 Sept. 2011
  • Firstpage
    6027
  • Lastpage
    6031
  • Abstract
    In this study, we test and analyze the performance of Gyrokinetic Torodial Code(GTC) program. According to the analysis results, we port GTC\´s compute-intensive subroutines to GPU and speed up them on the "CPU+GPU" heterogeneous architecture of TH-1A supercomputer. Some optimization strategies are developed in this process, for example, subroutines are integrated to reduce the data transfer between host and device, GPU memory access is optimized to reduce the access latency and static keyword is designed before arrays\´ declaration to avoid unnecessary address allocation and data copy. Experiment results show that the performance of the subroutines ported to GPU is improved evidently, which is between 6 and 8 times, and the total performance of GTC could be improved by 3 to 4 times.
  • Keywords
    computer graphic equipment; coprocessors; parallel machines; performance evaluation; GPU; GPU memory access; GTC; Gyrokinetic Torodial Code optimisation; TH-1A supercomputer; array declaration; data transfer; optimization strategies; performance analysis; Algorithms; Arrays; Graphics processing unit; Multicore processing; Optimization; Supercomputers; GPU; GTC; TH-1A; high performance computing; nuclear fusion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Control Engineering (ICECE), 2011 International Conference on
  • Conference_Location
    Yichang
  • Print_ISBN
    978-1-4244-8162-0
  • Type

    conf

  • DOI
    10.1109/ICECENG.2011.6057053
  • Filename
    6057053