DocumentCode :
2788017
Title :
An optimistic checkpointing and selective message logging approach for consistent global checkpoint collection in distributed systems
Author :
Jiang, Qiangfeng ; Manivannan, D.
Author_Institution :
Dept. of Comput. Sci., Kentucky Univ., Lexington, KY
fYear :
2007
fDate :
26-30 March 2007
Firstpage :
1
Lastpage :
10
Abstract :
In this paper, we present an asynchronous consistent global checkpoint collection algorithm which prevents contention for network storage at the file server and hence reduces the checkpointing overhead. The algorithm has two phases: In the first phase, a process initiates consistent global checkpoint collection by saving its state tentatively and asynchronously (called tentative checkpoint) in local memory or remote stable storage if there is no contention for stable storage while saving the state; in the second phase, the message log associated with the tentative checkpoint is stored in stable storage (checkpoint finalization phase). The tentative checkpoint together with the associated message log stored in the stable storage becomes part of a consistent global checkpoint. Under our algorithm, two or more processes can concurrently initiate consistent global checkpoint collection. Every tentative checkpoint will be finalized successfully unless a failure occurs. The finalized checkpoints of each process is assigned a unique sequence number in ascending order. Finalized checkpoints with same sequence number form a consistent global checkpoint.
Keywords :
checkpointing; message passing; distributed system; network storage; optimistic checkpointing; selective message logging; tentative checkpoint; Algorithm design and analysis; Checkpointing; Computer science; Control systems; Degradation; Delay; File servers; System performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location :
Long Beach, CA
Print_ISBN :
1-4244-0910-1
Electronic_ISBN :
1-4244-0910-1
Type :
conf
DOI :
10.1109/IPDPS.2007.370308
Filename :
4228036
Link To Document :
بازگشت