DocumentCode
1655789
Title
Communication pattern based checkpointing coordination for fault-tolerant distributed computing systems
Author
Park, Taesoon ; Yeom, Heon Y.
Author_Institution
Dept. of Inf. Sci., Sejong Univ., Seoul, South Korea
fYear
1998
Firstpage
559
Lastpage
562
Abstract
This paper presents a new checkpointing coordination scheme which utilizes the communication pattern of the cooperating processes. In the proposed scheme, the checkpointing is coordinated for the limited number of processes based on the information regarding the communication pattern of the target program. Unlike the previous solutions which do not utilize the communication pattern, it is possible to reduce the coordination effort as well as the checkpointing frequency. Extensive simulation has been performed to evaluate the performance of the proposed scheme and we concluded that the proposed scheme significantly reduces the checkpointing overhead compared with the loose coordination schemes
Keywords
concurrency control; distributed algorithms; fault tolerant computing; checkpointing; checkpointing coordination; distributed computing; fault-tolerant; Checkpointing; Distributed computing; Fault tolerance; Fault tolerant systems; Frequency; Information science; Performance evaluation; Resumes;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Networking, 1998. (ICOIN-12) Proceedings., Twelfth International Conference on
Conference_Location
Tokyo
Print_ISBN
0-8186-7225-0
Type
conf
DOI
10.1109/ICOIN.1998.648447
Filename
648447
Link To Document