DocumentCode
2820972
Title
Consistent state restoration in shared memory systems
Author
Baldoni, Roberto ; Helary, J.-M. ; Mostefaoui, Achour ; Raynal, Michel
Author_Institution
IRISA, Rennes, France
fYear
1997
fDate
19-21 Mar 1997
Firstpage
330
Lastpage
337
Abstract
In many systems, backward recovery constitutes a classical technique to ensure fault-tolerance. It consists in restoring a computation in a consistent global state, saved in a global checkpoint, from which this computation can be resumed. A global checkpoint includes a set of local checkpoints, one from each process which correspond to local states dumped onto stable storage. In this paper we are interested in defining formally the domino effect for shared memory systems be the shared memory a physical one (as in multiprocessor systems) or a virtual one (as in distributed shared memory systems) and in designing a domino-free adaptive algorithm. These results lie on a necessary and sufficient condition which shows when a set of local checkpoints can belong to some consistent global checkpoint
Keywords
shared memory systems; system recovery; backward recovery; consistent global state; domino-free adaptive algorithm; fault-tolerance; global checkpoint; shared memory systems; state restoration; Adaptive algorithm; Algorithm design and analysis; Checkpointing; Context modeling; Distributed computing; Fault tolerant systems; Kernel; Multiprocessing systems; Parallel machines; Protocols;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Parallel and Distributed Computing, 1997. Proceedings
Conference_Location
Shanghai
Print_ISBN
0-8186-7876-3
Type
conf
DOI
10.1109/APDC.1997.574051
Filename
574051
Link To Document