DocumentCode
3244782
Title
EROS: an efficient method for minimizing the replay time based on the replay dependence relation
Author
Thoai, Nam ; Kranzlmüller, Dieter ; Volkert, Jens
Author_Institution
Johannes Kepler Univ., Linz, Austria
fYear
2003
fDate
5-7 Feb. 2003
Firstpage
23
Lastpage
30
Abstract
Debugging parallel programs is still a challenge although many debugging techniques and corresponding tools have been developed. One reason why parallel programs are difficult to debug is the irreproducibility effect, which is caused by nondeterminism occurring at process interaction. Solutions to this problem are provided by so-called record and replay mechanisms or replay techniques. However, the rather long waiting time during replay often prohibits inclusion of these techniques to debugging tools, which must provide some degree of interactivity for user´s investigations. This paper analyzes the possibility of combining debugging with checkpointing to shorten waiting time during re-executions. Related work in this area either cannot ensure a short waiting time or creates other effects on the autonomy of processes. The EROS approach introduces the replay dependence relation, which allows us to minimize the waiting time without the restrictions above. In fact, EROS provides a small upper bound of the replay time during debugging nondeterministic message passing programs.
Keywords
message passing; parallel programming; program debugging; system recovery; EROS; checkpointing; debugging; interactivity; irreproducibility effect; nondeterministic message passing programs; parallel programs; record and replay mechanisms; replay dependence relation; replay time minimization; upper bound; Checkpointing; Computer bugs; Debugging; Europe; Fault tolerance; Large-scale systems; Message passing; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel, Distributed and Network-Based Processing, 2003. Proceedings. Eleventh Euromicro Conference on
Conference_Location
Genova, Italy
ISSN
1066-6192
Print_ISBN
0-7695-1875-3
Type
conf
DOI
10.1109/EMPDP.2003.1183561
Filename
1183561
Link To Document