DocumentCode :
921392
Title :
Rollback recovery in distributed systems using loosely synchronized clocks
Author :
Tong, Zhijun ; Kain, Richard Y. ; Tsai, W.T.
Author_Institution :
Bit 3 Comput. Corp., Minneapolis, MN, USA
Volume :
3
Issue :
2
fYear :
1992
fDate :
3/1/1992 12:00:00 AM
Firstpage :
246
Lastpage :
251
Abstract :
A rollback recovery scheme for distributed systems is proposed. The state-save synchronization among processes is implemented by bounding clock drifts such that no state-save synchronization messages are required. Since the clocks are only loosely synchronized, the synchronization overhead can be negligible in many applications. An interprocess communication protocol which encodes state-save progress information within message frames is introduced to checkpoint consistent system states. A rollback recovery algorithm that will force a minimum number of nodes to roll back after failures is developed
Keywords :
distributed processing; programming theory; protocols; clock drifts; consistent system states; distributed systems; encodes; interprocess communication protocol; loosely synchronized clocks; message frames; rollback recovery algorithm; state-save progress information; state-save synchronization messages; Checkpointing; Clocks; Computer science; Concrete; Delay; Fault detection; Fault tolerant systems; Protocols; Synchronization;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.127264
Filename :
127264
Link To Document :
بازگشت