• DocumentCode
    3244782
  • Title

    EROS: an efficient method for minimizing the replay time based on the replay dependence relation

  • Author

    Thoai, Nam ; Kranzlmüller, Dieter ; Volkert, Jens

  • Author_Institution
    Johannes Kepler Univ., Linz, Austria
  • fYear
    2003
  • fDate
    5-7 Feb. 2003
  • Firstpage
    23
  • Lastpage
    30
  • Abstract
    Debugging parallel programs is still a challenge although many debugging techniques and corresponding tools have been developed. One reason why parallel programs are difficult to debug is the irreproducibility effect, which is caused by nondeterminism occurring at process interaction. Solutions to this problem are provided by so-called record and replay mechanisms or replay techniques. However, the rather long waiting time during replay often prohibits inclusion of these techniques to debugging tools, which must provide some degree of interactivity for user´s investigations. This paper analyzes the possibility of combining debugging with checkpointing to shorten waiting time during re-executions. Related work in this area either cannot ensure a short waiting time or creates other effects on the autonomy of processes. The EROS approach introduces the replay dependence relation, which allows us to minimize the waiting time without the restrictions above. In fact, EROS provides a small upper bound of the replay time during debugging nondeterministic message passing programs.
  • Keywords
    message passing; parallel programming; program debugging; system recovery; EROS; checkpointing; debugging; interactivity; irreproducibility effect; nondeterministic message passing programs; parallel programs; record and replay mechanisms; replay dependence relation; replay time minimization; upper bound; Checkpointing; Computer bugs; Debugging; Europe; Fault tolerance; Large-scale systems; Message passing; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing, 2003. Proceedings. Eleventh Euromicro Conference on
  • Conference_Location
    Genova, Italy
  • ISSN
    1066-6192
  • Print_ISBN
    0-7695-1875-3
  • Type

    conf

  • DOI
    10.1109/EMPDP.2003.1183561
  • Filename
    1183561