• DocumentCode
    3469049
  • Title

    Reliable probabilistic checkpointing

  • Author

    Nam, Hyo-Chang ; Kim, Jong ; Hong, Sungje ; Lee, Sunggu

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pohang Inst. of Sci. & Technol., South Korea
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    153
  • Lastpage
    160
  • Abstract
    Recently proposed probabilistic checkpointing has one drawback, naming aliasing. When analyzed, 64-bit signatures show negligible possibility of aliasing. But in practice, the shift-XOR signature generation function used with probabilistic checkpointing shows a high aliasing rate, which limits the practicality of probabilistic checkpointing. In this paper, two enhancements are considered to make probabilistic checkpointing more reliable. One is the signature generation function and the other is the recovery scheme. In the signature generation function part, we propose two signature generation functions: HALF for small block sizes (less than or equal to 256 bytes) and C-HALF(CRC combined HALF) for large block sizes (larger than 256 bytes), which have an aliasing probability similar to analytic results and small overhead. In the recovery scheme part, we propose a recovery scheme which ensures the safety of probabilistic checkpointing. To examine the correctness of previous checkpoints at recovery time, the proposed recovery scheme uses a spare node. We analyze the recovery scheme using a mathematical model. Also an optimal checkpoint interval is derived using the model
  • Keywords
    probability; software fault tolerance; system recovery; aliasing; checkpoints; mathematical model; reliable probabilistic checkpointing; shift-XOR signature generation function; system recovery scheme; Checkpointing; Computer science; Costs; Electronic switching systems; Failure analysis; Fault tolerance; Mathematical model; Protection; Reliability engineering; Safety;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Computing, 1999. Proceedings. 1999 Pacific Rim International Symposium on
  • Print_ISBN
    0-7695-0371-3
  • Type

    conf

  • DOI
    10.1109/PRDC.1999.816224
  • Filename
    816224