• DocumentCode
    169063
  • Title

    HPCG: Preliminary Evaluation and Optimization on Tianhe-2 CPU-only Nodes

  • Author

    Cheng Chen ; Yunfei Du ; Hao Jiang ; Ke Zuo ; Canqun Yang

  • Author_Institution
    Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2014
  • fDate
    22-24 Oct. 2014
  • Firstpage
    41
  • Lastpage
    48
  • Abstract
    HPCG has become a new metric for the design and ranking of HPC. By incorporating a local symmetric Gauss-Seidel preconditioned, HPCG implements the Conjugate Gradient method to solve a sparse linear system. HPCG performs poorly with irregular memory access and may consume a great deal of MPI resources when it is executed on supercomputers. This paper focuses on optimizing SpMV and the Gauss-Seidel preconditioned, the two most important kernels in HPCG. By evaluating the performance impacts of several representative sparse matrix formats, ELLPACK is selected due to its suitability for SIMD, resulting in a speedup of 2.3x for the SpMV kernel. Multi-coloring is performed for Gauss-Seidel, resulting in a speedup of 7.3x over the reference implementation. The CG convergence rate may also be improved after multi-coloring. Our experimental results show that our optimization process works well on supercomputers, achieving 6.5 Gflops on a CPU-only node. This has boosted the total HPCG Gflops by about 7x, giving rise to 80,151 Gflops on 8192 CPU-only Tianhe-2 nodes.
  • Keywords
    application program interfaces; conjugate gradient methods; flip-flops; iterative methods; message passing; microprocessor chips; optimisation; parallel machines; performance evaluation; sparse matrices; ELLPACK; Gauss-Seidel multicoloring; HPC ranking; HPCG Gflops; MPI resources; SIMD; SpMV kernel; Tianhe-2 CPU-only nodes; conjugate gradient method; local symmetric Gauss-Seidel preconditioner; memory access; optimization process; performance evaluation; sparse linear system; sparse matrix formats; supercomputers; Arrays; Color; Convergence; Instruction sets; Linear systems; Parallel processing; Sparse matrices; Gauss-Seidel; HPCG; Tianhe-2; multi-color; sparse matrix;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture and High Performance Computing (SBAC-PAD), 2014 IEEE 26th International Symposium on
  • Conference_Location
    Jussieu
  • ISSN
    1550-6533
  • Type

    conf

  • DOI
    10.1109/SBAC-PAD.2014.10
  • Filename
    6970645