DocumentCode
3469049
Title
Reliable probabilistic checkpointing
Author
Nam, Hyo-Chang ; Kim, Jong ; Hong, Sungje ; Lee, Sunggu
Author_Institution
Dept. of Comput. Sci. & Eng., Pohang Inst. of Sci. & Technol., South Korea
fYear
1999
fDate
1999
Firstpage
153
Lastpage
160
Abstract
Recently proposed probabilistic checkpointing has one drawback, naming aliasing. When analyzed, 64-bit signatures show negligible possibility of aliasing. But in practice, the shift-XOR signature generation function used with probabilistic checkpointing shows a high aliasing rate, which limits the practicality of probabilistic checkpointing. In this paper, two enhancements are considered to make probabilistic checkpointing more reliable. One is the signature generation function and the other is the recovery scheme. In the signature generation function part, we propose two signature generation functions: HALF for small block sizes (less than or equal to 256 bytes) and C-HALF(CRC combined HALF) for large block sizes (larger than 256 bytes), which have an aliasing probability similar to analytic results and small overhead. In the recovery scheme part, we propose a recovery scheme which ensures the safety of probabilistic checkpointing. To examine the correctness of previous checkpoints at recovery time, the proposed recovery scheme uses a spare node. We analyze the recovery scheme using a mathematical model. Also an optimal checkpoint interval is derived using the model
Keywords
probability; software fault tolerance; system recovery; aliasing; checkpoints; mathematical model; reliable probabilistic checkpointing; shift-XOR signature generation function; system recovery scheme; Checkpointing; Computer science; Costs; Electronic switching systems; Failure analysis; Fault tolerance; Mathematical model; Protection; Reliability engineering; Safety;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Computing, 1999. Proceedings. 1999 Pacific Rim International Symposium on
Print_ISBN
0-7695-0371-3
Type
conf
DOI
10.1109/PRDC.1999.816224
Filename
816224
Link To Document