DocumentCode
2932836
Title
Improvements and Reconsideration of Distributed Snapshot Protocols
Author
Agbaria, Adnan
Author_Institution
IBM Haifa Res. Lab
fYear
2006
fDate
2-4 Oct. 2006
Firstpage
155
Lastpage
164
Abstract
Distributed snapshots are an important building block for distributed systems, and, among other applications, are useful for constructing efficient checkpointing protocols. In addition to the imposed overhead of the existing distributed snapshot protocols, those protocols are not trivially applicable (if at all) in many of today´s distributed systems, e.g., grid, mobile, and sensors systems. After presenting the shortages and the inapplicability of the most popular existing distributed snapshot protocols, this paper discusses improvement directions for the protocols. In addition, it presents a new and an important improvement for the most popular distributed snapshot protocol, which was presented by Chandy and Lamport in 1985. Although the proposed improvement is simple and easy to implement, it has significant benefits in reducing the software and hardware overheads of distributed snapshots. Then, the paper presents proofs for the safety and progress of the new protocol. Lastly, it presents a performance analysis of the protocol using stochastic models
Keywords
checkpointing; distributed processing; performance evaluation; protocols; checkpointing protocol; distributed snapshot protocol; distributed system; protocol performance analysis; stochastic model; Access protocols; Checkpointing; Control systems; Grid computing; Hardware; Performance analysis; Safety; Sensor systems; Software performance; Wireless application protocol;
fLanguage
English
Publisher
ieee
Conference_Titel
Reliable Distributed Systems, 2006. SRDS '06. 25th IEEE Symposium on
Conference_Location
Leeds
ISSN
1060-9857
Print_ISBN
0-7695-2677-2
Type
conf
DOI
10.1109/SRDS.2006.26
Filename
4032477
Link To Document