Title :
On-the-fly replay: a practical paradigm and its implementation for distributed debugging
Author :
Gerstel, O. ; Zaks, Shmuel ; Hurfin, M. ; Plouzeau, N. ; Raynal, M.
Author_Institution :
Dept. of Comput. Sci., Israel Inst. of Technol., Haifa, Israel
Abstract :
This paper presents a practical paradigm, called on-the-fly replay. This paradigm consists of running a distributed program twice at the same time: an original computation is running in a regular fashion, which also includes steps of making non-deterministic choices; this execution is driving a twin execution, whose non-deterministic choices do not have to be evaluated (since they are taken from the original computation). This paradigm has several interesting uses. Among them, distributed debugging is particularly noteworthy. The integration of this paradigm into a distributed debugging facility, called EREBUS, is described. This implementation was run on a distributed memory parallel machine (Intel Hypercube iPSC2) and experimental results are described, that demonstrate the advantage of this paradigm
Keywords :
distributed processing; program debugging; programming; EREBUS; distributed debugging; distributed debugging facility; distributed memory parallel machine; distributed program; on-the-fly replay; practical paradigm; Computational modeling; Computer science; Debugging; Delay; Distributed computing; Hardware; Hypercubes; Monitoring; Parallel machines; Probes;
Conference_Titel :
Parallel and Distributed Processing, 1994. Proceedings. Sixth IEEE Symposium on
Conference_Location :
Dallas, TX
Print_ISBN :
0-8186-6427-4
DOI :
10.1109/SPDP.1994.346158