Title of article :
A quasi-synchronous checkpointing algorithm that prevents contention for stable storage
Author/Authors :
D. Manivannan، نويسنده , , Q. Jiang، نويسنده , , Jianchang Yang، نويسنده , , M. Singhal، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2008
Abstract :
Checkpointing and rollback recovery are established techniques for handling failures in distributed systems. Under synchronous checkpointing, each process involved in the distributed computation takes checkpoint almost simultaneously. This causes contention for network stable storage and hence degrades performance as processes may have to wait for long time for the checkpointing operation to complete. In this paper, we propose a staggered quasi-synchronous checkpointing algorithm which reduces contention for network stable storage without any synchronization overhead.
Keywords :
Communication-induced checkpointing , Uncoordinated , fault-tolerance , Rollback recovery , Staggered checkpointing , Failure-recovery , Checkpoint staggering , Distributed checkpointing
Journal title :
Information Sciences
Journal title :
Information Sciences