Title :
Optimizing Adaptive Checkpointing Schemes for Grid Workflow Systems
Author :
Xiang, Yang ; Li, Zhongwen ; Chen, Hong
Author_Institution :
Sch. of Eng. & Inf. Technol., Deakin Univ., Melbourne, Vic.
Abstract :
One of the major challenges in wide use of grid workflow systems is fault tolerance and avoidance. Checkpointing schemes provide a way of fault detection and recovery. In our research, we focus on performance optimization of checkpointing schemes for grid workflow systems. We propose a set of adaptive checkpointing schemes that dynamically adjust the checkpointing intervals online by using store-checkpoints (SCPs) and compare-checkpoints (CCPs). These schemes can efficiently utilize comparison and storage operations and significantly improve the performance. Further, these schemes can calculate the optimal numbers of checkpoints by which minimize the mean execution time. We also expand the schemes from single-task execution scenarios to multitask execution scenarios. Simulation results show these schemes outstandingly increase the likelihood of timely task completion when faults occur
Keywords :
checkpointing; fault tolerant computing; grid computing; adaptive checkpointing; compare-checkpointing; fault avoidance; fault detection; fault recovery; fault tolerance; grid workflow systems; store-checkpointing; Australia; Checkpointing; Educational institutions; Fault detection; Fault tolerant systems; Frequency; Information science; Information technology; Optimization; Redundancy;
Conference_Titel :
Grid and Cooperative Computing Workshops, 2006. GCCW '06. Fifth International Conference on
Conference_Location :
Hunan
Print_ISBN :
0-7695-2695-0
DOI :
10.1109/GCCW.2006.69