• DocumentCode
    3223265
  • Title

    Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles

  • Author

    Tallent, Nathan R. ; Adhianto, Laksono ; Mellor-Crummey, John M.

  • Author_Institution
    Rice Univ., Houston, TX, USA
  • fYear
    2010
  • fDate
    13-19 Nov. 2010
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    Applications must scale well to make efficient use of today´s class of petascale computers, which contain hundreds of thousands of processor cores. Inefficiencies that do not even appear in modest-scale executions can become major bottlenecks in large-scale executions. Because scaling problems are often difficult to diagnose, there is a critical need for scalable tools that guide scientists to the root causes of scaling problems. Load imbalance is one of the most common scaling problems. To provide actionable insight into load imbalance, we present post-mortem parallel analysis techniques for pinpointing and quantifying load imbalance in the context of call path profiles of parallel programs. We show how to identify load imbalance in its static and dynamic context by using only low-overhead asynchronous call path profiling to locate regions of code responsible for communication wait time in SPMD executions. We describe the implementation of these techniques within HPCTOOLKIT.
  • Keywords
    parallel programming; power aware computing; HPCTOOLKIT; SPMD executions; load imbalance; low overhead asynchronous call path profiling; parallel programs; petascale computers; post mortem parallel analysis techniques; processor cores; scalable identification; Context; Databases; Equations; Instruction sets; Instruments; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for
  • Conference_Location
    New Orleans, LA
  • Print_ISBN
    978-1-4244-7557-5
  • Electronic_ISBN
    978-1-4244-7558-2
  • Type

    conf

  • DOI
    10.1109/SC.2010.47
  • Filename
    5644887