DocumentCode
2265464
Title
On checkpointing strategies in unreliable computing environments
Author
Fiorini, P.M.
Author_Institution
C&F Search Marketing, Miami, FL, USA
Volume
1
fYear
2011
fDate
15-17 Sept. 2011
Firstpage
193
Lastpage
197
Abstract
In this paper, we analyze performance implications of checkpointing strategies in unreliable computing environments. We show that if the appropriate checkpointing strategy is not chosen, the time to complete a job is heavy-tailed distributed. This can lead to highly-variable and long completion times. We generate asymptotics for job completion times when there is no checkpointing, a fixed number of random checkpoints, and when checkpoints occur at fixed intervals for various task time distributions. Our asymptotic results are derived using large deviation theory.
Keywords
checkpointing; reliability; ubiquitous computing; asymptotics; checkpointing strategies; deviation theory; fixed intervals; heavy tailed distributed system; job completion times; random checkpoints; task time distribution; unreliable computing environments; Checkpointing; Computational modeling; Equations; Markov processes; Mathematical model; Random variables; Tin; RESTART; asymptotics; checkpointing; failure; heavy-tail; large deviation theory; pri; recovery; unreliable systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Data Acquisition and Advanced Computing Systems (IDAACS), 2011 IEEE 6th International Conference on
Conference_Location
Prague
Print_ISBN
978-1-4577-1426-9
Type
conf
DOI
10.1109/IDAACS.2011.6072739
Filename
6072739
Link To Document