• DocumentCode
    2484732
  • Title

    Application profiling on Cell-based clusters

  • Author

    Dursun, Hikmet ; Barker, Kevin J. ; Kerbyson, Darren J. ; Pakin, Scott

  • Author_Institution
    Performance & Archit. Lab. (PAL), Los Alamos Nat. Lab., Los Alamos, NM, USA
  • fYear
    2009
  • fDate
    23-29 May 2009
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    In this paper, we present a methodology for profiling parallel applications executing on the IBM PowerXCell 8i (commonly referred to as the ldquoCellrdquo processor). Specifically, we examine Cell-centric MPI programs on hybrid clusters containing multiple Opteron and Cell processors per node such as those used in the petascale Roadrunner system. Our implementation incurs less than 3.2 mus of overhead per profile call while efficiently utilizing the limited local store of the Cell´s SPE cores. We demonstrate the use of our profiler on a cluster of hybrid nodes running a suite of scientific applications. Our analyses of inter-SPE communication (across the entire cluster) and function call patterns provide valuable information that can be used to optimize application performance.
  • Keywords
    application program interfaces; message passing; multiprocessing systems; parallel processing; workstation clusters; Cell processor; Cell-based clusters; Cell-centric MPI programs; IBM PowerXCell 8i; Opteron processors; interSPE communication; parallel application profiling; petascale Roadrunner system; scientific applications; Application software; Collaboration; Computational modeling; Computer architecture; Computer science; Computer simulation; Concurrent computing; Laboratories; Performance analysis; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
  • Conference_Location
    Rome
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-3751-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2009.5161092
  • Filename
    5161092