DocumentCode
2539732
Title
A new look at atomic broadcast in the asynchronous crash-recovery model
Author
Mena, Sergio E. ; Schiper, André
Author_Institution
Ecole Polytech. Fed. de Lausanne, Switzerland
fYear
2005
fDate
26-28 Oct. 2005
Firstpage
202
Lastpage
211
Abstract
Atomic broadcast in particular, and group communication in general, have mainly been specified and implemented in a system model where processes do not recover after a crash. The model is called crash-stop. The drawback of this model is its inability to express algorithms that tolerate the crash of a majority of processes. This has led to extend the crash-stop model to the so-called crash-recovery model, in which processes have access to stable storage, to log their state periodically. This allows them to recover a previous state after a crash. However, the existing specifications of atomic broadcast in the crash-recovery model are not satisfactory, and the paper explains why. The paper also proposes a new specification of atomic broadcast in the crash-recovery model that addresses these issues. Specifically, our new specification allows to distinguish between a uniform and a non-uniform version of atomic broadcast. The non-uniform version logs less information, and is thus more efficient. The uniform and non-uniform atomic broadcast have been implemented and compared with a published atomic broadcast algorithm. Performance results are presented.
Keywords
broadcasting; fault tolerant computing; message passing; multiprocessing systems; system recovery; asynchronous crash-recovery model; atomic broadcast specification; crash-stop model; group communication; Broadcasting; Computer crashes; Context modeling; Database systems; Detectors; Distributed computing; Electronic mail; Fault tolerance;
fLanguage
English
Publisher
ieee
Conference_Titel
Reliable Distributed Systems, 2005. SRDS 2005. 24th IEEE Symposium on
Print_ISBN
0-7695-2463-X
Type
conf
DOI
10.1109/RELDIS.2005.6
Filename
1541197
Link To Document