• DocumentCode
    3146647
  • Title

    Hierarchical Mapping for HPC Applications

  • Author

    Chung, I-Hsin ; Lee, Che-Rung ; Jiazheng Zhou ; Chung, Zhou Yeh-Ching

  • Author_Institution
    T.J. Watson Res. Center, IBM, Yorktown Heights, NY, USA
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    1815
  • Lastpage
    1823
  • Abstract
    As the high performance computing systems scale up, mapping the tasks of a parallel application onto physical processors to allow efficient communication becomes one of the critical performance issues. Existing algorithms were usually designed to map applications with regular communication patterns. Their mapping criterion usually overlooks the size of communicated messages, which is the primary factor of communication time. In addition, most of their time complexities are too high to process large scale problems. In this paper, we present a hierarchical mapping algorithm (HMA), which is capable of mapping applications with irregular communication patterns. It first partitions tasks according to their run-time communication information. The tasks that communicate with each others more frequently are regarded as strongly connected. Based on their connectivity strength, the tasks are partitioned into super nodes based on the algorithms in spectral graph theory. The hierarchical partitioning reduces the mapping algorithm complexity to achieve scalability. Finally, the run-time communication information will be used again in fine tuning to explore better mappings. With the experiments, we show how the mapping algorithm helps to reduce the point-to-point communication time for the PDGEMM, a ScaLAPACK matrix multiplication computation kernel, up to 20% and the AMG2006, a tier 1 application of the Sequoia benchmark, up to 7%.
  • Keywords
    computational complexity; graph theory; parallel processing; HPC applications; PDGEMM; ScaLAPACK matrix multiplication computation kernel; Sequoia benchmark; communication patterns; connectivity strength; hierarchical mapping algorithm; mapping algorithm complexity; point-to-point communication time; runtime communication information; spectral graph theory; Algorithm design and analysis; Eigenvalues and eigenfunctions; Optimization; Partitioning algorithms; Program processors; Topology; Tuning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-425-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.340
  • Filename
    6009050