• DocumentCode
    1871808
  • Title

    Continuous performance monitoring for large-scale parallel applications

  • Author

    Dooley, Isaac ; Lee, Chee Wai ; Kale, Laxmikant V.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois, Urbana, IL, USA
  • fYear
    2009
  • fDate
    16-19 Dec. 2009
  • Firstpage
    445
  • Lastpage
    452
  • Abstract
    Traditional performance analysis techniques are performed after a parallel program has completed. In this paper, we describe an online method for continuously monitoring the performance of a parallel program, specifically the fraction of the time spent in various activities as the program executes. Our implementation of both a visualization client and the parallel performance framework that gathers utilization data are described. The data gathering uses a scalable and asynchronous reduction with an appropriate lossless compressed data format. The overheads in the initial system are low, even when run on thousands of processors. The data gathering occurs in an out-of-band communication mechanism, interleaving itself transparently with the execution of the parallel application by leveraging a message-driven runtime system.
  • Keywords
    data visualisation; large-scale systems; parallel programming; software performance evaluation; asynchronous reduction; continuous performance monitoring; data gathering; large-scale parallel program; message-driven runtime system; online method; out-of-band communication mechanism; parallel performance framework; scalable reduction; visualization client; Application software; Computer science; Computerized monitoring; Concurrent computing; Data visualization; Degradation; Interleaved codes; Large-scale systems; Performance analysis; Programming profession;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing (HiPC), 2009 International Conference on
  • Conference_Location
    Kochi
  • Print_ISBN
    978-1-4244-4922-4
  • Electronic_ISBN
    978-1-4244-4921-7
  • Type

    conf

  • DOI
    10.1109/HIPC.2009.5433181
  • Filename
    5433181