DocumentCode
1699064
Title
An index-based checkpointing algorithm for autonomous distributed systems
Author
Baldoni, Roberto ; Quaglia, Francesco ; Fornara, Paolo
Author_Institution
Dipt. di Inf. e Sistemistica, Rome Univ., Italy
fYear
1997
Firstpage
27
Lastpage
34
Abstract
The paper presents an index based checkpointing algorithm for distributed systems with the aim of reducing the total number of checkpoints while ensuring that each checkpoint belongs to at least one consistent global checkpoint (or recovery line). The algorithm is based on an equivalence relation defined between pairs of successive checkpoints of a process which allows, in some cases, to advance the recovery line of the computation without forcing check points in other processes. This protocol shows good performance, especially in autonomous environments, where each process does not have any private information about other processes
Keywords
distributed processing; fault tolerant computing; reliability; software fault tolerance; system recovery; autonomous distributed systems; autonomous environments; consistent global checkpoint; equivalence relation; index based checkpointing algorithm; protocol; recovery line; successive checkpoints; Algorithm design and analysis; Checkpointing; Communication system control; Contracts; Distributed computing; Fault tolerant systems; Force control; Process design; Protocols; Remuneration;
fLanguage
English
Publisher
ieee
Conference_Titel
Reliable Distributed Systems, 1997. Proceedings., The Sixteenth Symposium on
Conference_Location
Durham, NC
ISSN
1060-9857
Print_ISBN
0-8186-8177-2
Type
conf
DOI
10.1109/RELDIS.1997.632793
Filename
632793
Link To Document