Title :
Optimal checkpointing and rollback strategies with media failures: statistical estimation algorithms
Author :
Dohi, Tadashi ; Kaio, Naoto ; Osaka, S.
Author_Institution :
Dept. of Ind. & Syst. Eng., Hiroshima Univ., Japan
Abstract :
This paper considers two stochastic models for a file recovery action with checkpoint generations when two kinds of failures; system failure and media failure, occur according to a homogeneous Poisson process and a renewal process, respectively. For the unknown media failure time distribution, we develop statistical nonparametric algorithms to estimate the optimal checkpoint intervals which maximize the system availabilities. The algorithms proposed are based on the corresponding total time on test (TTT) statistics to the media failure time distribution, and can provide strongly consistent estimates from its sample data
Keywords :
nonparametric statistics; software fault tolerance; statistical analysis; stochastic processes; system recovery; checkpoint generations; file recovery; homogeneous Poisson process; optimal checkpointing; rollback strategies; statistical estimation algorithms; statistical nonparametric algorithms; stochastic models; system failure; total time on test; Checkpointing; Statistical analysis; Statistical distributions; Stochastic systems; Testing;
Conference_Titel :
Dependable Computing, 1999. Proceedings. 1999 Pacific Rim International Symposium on
Print_ISBN :
0-7695-0371-3
DOI :
10.1109/PRDC.1999.816225