• DocumentCode
    2820972
  • Title

    Consistent state restoration in shared memory systems

  • Author

    Baldoni, Roberto ; Helary, J.-M. ; Mostefaoui, Achour ; Raynal, Michel

  • Author_Institution
    IRISA, Rennes, France
  • fYear
    1997
  • fDate
    19-21 Mar 1997
  • Firstpage
    330
  • Lastpage
    337
  • Abstract
    In many systems, backward recovery constitutes a classical technique to ensure fault-tolerance. It consists in restoring a computation in a consistent global state, saved in a global checkpoint, from which this computation can be resumed. A global checkpoint includes a set of local checkpoints, one from each process which correspond to local states dumped onto stable storage. In this paper we are interested in defining formally the domino effect for shared memory systems be the shared memory a physical one (as in multiprocessor systems) or a virtual one (as in distributed shared memory systems) and in designing a domino-free adaptive algorithm. These results lie on a necessary and sufficient condition which shows when a set of local checkpoints can belong to some consistent global checkpoint
  • Keywords
    shared memory systems; system recovery; backward recovery; consistent global state; domino-free adaptive algorithm; fault-tolerance; global checkpoint; shared memory systems; state restoration; Adaptive algorithm; Algorithm design and analysis; Checkpointing; Context modeling; Distributed computing; Fault tolerant systems; Kernel; Multiprocessing systems; Parallel machines; Protocols;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Parallel and Distributed Computing, 1997. Proceedings
  • Conference_Location
    Shanghai
  • Print_ISBN
    0-8186-7876-3
  • Type

    conf

  • DOI
    10.1109/APDC.1997.574051
  • Filename
    574051