Title :
Re-execution of distributed programs to detect bugs hidden by racing messages
Author :
Kilgore, Richard ; Chase, Craig
Author_Institution :
Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
Abstract :
Finding errors in non-deterministic programs is complicated by the fact that an anomaly may occur during one program execution and not the next. Our objective is to provide a practical, yet powerful testing environment for distributed systems, using re-execution. We focus on re-executing the program under a strictly different message ordering. We show that messages are grouped into waves, such that any two messages from different waves must always be received in the same order. We provide an algorithm that produces a re-execution that maximizes the number of reordered pairs of message delivery events. We also provide an efficient online algorithm for detecting racing messages
Keywords :
distributed algorithms; hazards and race conditions; program debugging; program testing; distributed program re-execution; hidden bug detection; message delivery events; message ordering; message waves; non-deterministic programs; online algorithm; program error finding; program execution anomalies; racing message detection; reordered pairs; software testing environment; Application software; Computer bugs; Debugging; Differential equations; Distributed computing; Power engineering and energy; Power engineering computing; Programming profession; Software testing; System testing;
Conference_Titel :
System Sciences, 1997, Proceedings of the Thirtieth Hawaii International Conference on
Conference_Location :
Wailea, HI
Print_ISBN :
0-8186-7743-0
DOI :
10.1109/HICSS.1997.667295