• DocumentCode
    1685220
  • Title

    Systematic assessment of the overhead of tracing parallel programs

  • Author

    Fagot, Alain ; de Kergommeaux, Jacques Chassin

  • Author_Institution
    APACHE Project, IMAG, Grenoble, France
  • fYear
    1996
  • Firstpage
    179
  • Lastpage
    186
  • Abstract
    Instant replay is a classical technique developed to help programmers to cope with transient errors occurring in non-deterministic executions of parallel programs. Enough information is recorded during an initial recording phase to be able to force subsequent re-executions to be deterministic with respect to the initial one. If the time overhead of the initial recording is sufficiently low, recording can be used as a normal execution mode of parallel programs. This article describes the method used to assess systematically the overhead of the recording phase of an instant replay tool implemented for a remote procedure call (RPC) based programming model named ATHAPASCAN. Evaluation was done using synthetic programs generated from program models of classical parallel algorithms. The generality of the method comes from the possibility of generating multiple program instances from a single algorithm model. Experimental results confirm the efficiency of the tested execution replay tool
  • Keywords
    parallel algorithms; parallel programming; program debugging; program diagnostics; recording; remote procedure calls; software tools; ATHAPASCAN; RPC-based programming model; deterministic reexecutions; efficiency; execution replay tool; generality; instant replay; multiple program instances generation; nondeterministic executions; overhead assessment; parallel algorithm model; parallel program tracing; recording phase; remote procedure call; software tool; synthetic programs; time overhead; transient errors; Control systems; Debugging; Load management; Operating systems; Parallel algorithms; Parallel programming; Pathology; Programming profession; Runtime; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing, 1996. PDP '96. Proceedings of the Fourth Euromicro Workshop on
  • Conference_Location
    Braga
  • Print_ISBN
    0-8186-7376-1
  • Type

    conf

  • DOI
    10.1109/EMPDP.1996.500585
  • Filename
    500585