Title :
Combining periodic and probabilistic checkpointing in optimistic simulation
Author :
Quaglia, Francesco
Author_Institution :
Dipt. di Inf. e Sistemistica, Rome Univ., Italy
Abstract :
This paper presents a checkpointing scheme for optimistic simulation which is a mixed approach between periodic and probabilistic checkpointing. The latter based on statistical data collected during the simulation, aims at recording as checkpoints states of a logical process that have high probability to be restored due to rollback (this is done in order to make those states immediately available). The periodic part prevents performance degradation due to state reconstruction (coasting forward) cost whenever the collected statistics do not allow to identify states highly likely to be restored. Hence, this scheme can be seen as a highly general solution to tackle the checkpoint problem in optimistic simulation. A performance comparison with previous solutions is carried out through a simulation study of a store-and-forward communication network in a two-dimensional torus topology
Keywords :
distributed processing; software fault tolerance; software performance evaluation; system recovery; time warp simulation; logical process; optimistic simulation; performance degradation; periodic checkpointing; probabilistic checkpointing; rollback; simulation; state reconstruction; statistical data; store-and-forward communication network; time warp simulation; two-dimensional torus topology; Checkpointing; Costs; Discrete event simulation; Processor scheduling; Protocols; Remuneration; Statistics; Time warp simulation;
Conference_Titel :
Parallel and Distributed Simulation, 1999. Proceedings. Thirteenth Workshop on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7695-0155-9
DOI :
10.1109/PADS.1999.766167