DocumentCode :
1655789
Title :
Communication pattern based checkpointing coordination for fault-tolerant distributed computing systems
Author :
Park, Taesoon ; Yeom, Heon Y.
Author_Institution :
Dept. of Inf. Sci., Sejong Univ., Seoul, South Korea
fYear :
1998
Firstpage :
559
Lastpage :
562
Abstract :
This paper presents a new checkpointing coordination scheme which utilizes the communication pattern of the cooperating processes. In the proposed scheme, the checkpointing is coordinated for the limited number of processes based on the information regarding the communication pattern of the target program. Unlike the previous solutions which do not utilize the communication pattern, it is possible to reduce the coordination effort as well as the checkpointing frequency. Extensive simulation has been performed to evaluate the performance of the proposed scheme and we concluded that the proposed scheme significantly reduces the checkpointing overhead compared with the loose coordination schemes
Keywords :
concurrency control; distributed algorithms; fault tolerant computing; checkpointing; checkpointing coordination; distributed computing; fault-tolerant; Checkpointing; Distributed computing; Fault tolerance; Fault tolerant systems; Frequency; Information science; Performance evaluation; Resumes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Networking, 1998. (ICOIN-12) Proceedings., Twelfth International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-8186-7225-0
Type :
conf
DOI :
10.1109/ICOIN.1998.648447
Filename :
648447
Link To Document :
بازگشت