• DocumentCode
    760749
  • Title

    Design of new roll-forward recovery approach for distributed systems

  • Author

    Gupta, B. ; Banerjee, S.K. ; Liu, B.

  • Author_Institution
    Dept. of Comput. Sci., Southern Illinois Univ., Carbondale, IL, USA
  • Volume
    149
  • Issue
    3
  • fYear
    2002
  • fDate
    5/1/2002 12:00:00 AM
  • Firstpage
    105
  • Lastpage
    112
  • Abstract
    A new roll-forward checkpointing scheme is proposed using basic checkpoints. The direct-dependency concept used in the communication-induced checkpointing scheme is applied to basic checkpoints to design a simple algorithm to find a consistent global checkpoint. Both blocking (i.e. when the application processes are suspended during the execution of the algorithm) and non-blocking approaches are presented. The use of the concept of forced checkpoints ensures a small re-execution time after recovery from a failure. The proposed approaches enjoy the main advantages of both the synchronous and the asynchronous approaches, i.e. simple recovery and simple way to create checkpoints. Besides, in the proposed blocking approach, the direct-dependency concept is implemented without piggybacking any extra information with the application message. A very simple scheme for avoiding the creation of useless checkpoints is also proposed
  • Keywords
    system recovery; checkpointing scheme; communication-induced checkpointing scheme; distributed systems; roll-forward recovery approach;
  • fLanguage
    English
  • Journal_Title
    Computers and Digital Techniques, IEE Proceedings -
  • Publisher
    iet
  • ISSN
    1350-2387
  • Type

    jour

  • DOI
    10.1049/ip-cdt:20020410
  • Filename
    1008830