DocumentCode :
1336215
Title :
Design and analysis of an integrated checkpointing and recovery scheme for distributed applications
Author :
Ramamurthy, Bina ; Upadhyaya, Shambhu ; Bhargava, Bharat
Author_Institution :
Dept. of Comput. Sci., State Univ. of New York, Buffalo, NY, USA
Volume :
12
Issue :
2
fYear :
2000
Firstpage :
174
Lastpage :
186
Abstract :
An integrated checkpointing and recovery scheme which exploits the low latency and high coverage characteristics of a concurrent error detection scheme is presented. Message dependency, which is the main source of multistep rollback in distributed systems, is minimized by using a new message validation technique derived from the notion of concurrent error detection. The concept of a new global state matrix is introduced to track error checking and message dependency in a distributed system and assist in the recovery. The analytical model, algorithms and data structures to support an easy implementation of the new scheme are presented. The completeness and correctness of the algorithms are proved. A number of scenarios and illustrations that give the details of the analytical model are presented. The benefits of the integrated checkpointing scheme are quantified by means of simulation using an object-oriented test framework
Keywords :
data structures; distributed algorithms; error detection; minimisation; system recovery; algorithm completeness; algorithm correctness; analytical model; checkpointing scheme; concurrent error detection scheme; coverage; data structures; distributed applications; error-checking tracking; global state matrix; integrated scheme; latency; message dependency minimization; message logging; message validation technique; multistep rollback; object-oriented test framework; recovery scheme; simulation; Analytical models; Checkpointing; Costs; Data structures; Delay; Hardware; Mission critical systems; Object oriented modeling; Redundancy; Testing;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.842261
Filename :
842261
Link To Document :
بازگشت