• DocumentCode
    2736773
  • Title

    Trace-driven debugging of message passing programs

  • Author

    Frumkin, Michael ; Hood, Robert ; Lopez, Louis

  • Author_Institution
    NAS Syst. Div., NASA Ames Res. Center, Moffett Field, CA, USA
  • fYear
    1998
  • fDate
    30 Mar-3 Apr 1998
  • Firstpage
    753
  • Lastpage
    762
  • Abstract
    We report on features added to a parallel debugger to simplify the debugging of message passing programs. These features include replay, setting consistent breakpoints based on interprocess event causality, a parallel undo operation, and communication supervision. These features all use trace information collected during the execution of the program being debugged. We used a number of different instrumentation techniques to collect traces. We also implemented trace displays using two different trace visualization systems. The implementation was tested on an SGI Power Challenge cluster and a network of SGI workstations
  • Keywords
    local area networks; message passing; parallel programming; program debugging; SGI Power Challenge cluster; SGI workstation network; communication supervision; consistent breakpoints; instrumentation techniques; interprocess event causality; message passing program debugging; parallel undo operation; replay; trace displays; trace driven debugging; trace visualization systems; Debugging; History; Instruments; Libraries; Message passing; Monitoring; NASA; Space technology; Visualization; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing Symposium, 1998. IPPS/SPDP 1998. Proceedings of the First Merged International ... and Symposium on Parallel and Distributed Processing 1998
  • Conference_Location
    Orlando, FL
  • ISSN
    1063-7133
  • Print_ISBN
    0-8186-8404-6
  • Type

    conf

  • DOI
    10.1109/IPPS.1998.670012
  • Filename
    670012