Title :
A new, efficient coordinated checkpointing protocol combined with selective sender-based message logging
Author :
Rao, Ch D V Subba ; Naidu, M.M.
Author_Institution :
Sri Venkateswara Univ., Tirupati
fDate :
March 31 2008-April 4 2008
Abstract :
Checkpointing and message logging are the popular and general-purpose tools for providing fault- tolerance in distributed systems. The most of the Coordinated checkpointing algorithms available in the literature have not addressed about treatment of the lost messages and these algorithms suffer from high output commit latency. To overcome the above limitations, we propose a new coordinated checkpointing protocol combined with selective sender-based message logging. The protocol is free from the problem of lost messages. The term ´selective´ implies that messages are logged only within a specified interval known as active interval, thereby reducing message logging overhead. All processes take checkpoints at the end of their respective active intervals forming a consistent global state. Outside the active interval there is no checkpointing of process state. This protocol minimizes different overheads i.e. checkpointing overhead, message logging overhead, recovery overhead and blocking overhead. Unlike blocking coordinated checkpointing, the disk contentions are less in the proposed protocol.
Keywords :
checkpointing; fault tolerant computing; message passing; coordinated checkpointing protocol; distributed system; fault-tolerance; selective sender-based message logging; Checkpointing; Computer science; Counting circuits; Delay; Educational institutions; Electronic mail; Fault tolerant systems; Protocols; Resumes; Signal processing; Checkpointing; Distributed Systems; Fault Tolerance; Message Logging;
Conference_Titel :
Computer Systems and Applications, 2008. AICCSA 2008. IEEE/ACS International Conference on
Conference_Location :
Doha
Print_ISBN :
978-1-4244-1967-8
Electronic_ISBN :
978-1-4244-1968-5
DOI :
10.1109/AICCSA.2008.4493571