DocumentCode :
1336221
Title :
An architecture for survivable coordination in large distributed systems
Author :
Malkhi, Dahlia ; Reiter, Michael K.
Author_Institution :
Sch. of Comput. Sci. & Eng., Hebrew Univ., Jerusalem, Israel
Volume :
12
Issue :
2
fYear :
2000
Firstpage :
187
Lastpage :
202
Abstract :
Coordination among processes in a distributed system can be rendered very complex in a large-scale system where messages may be delayed or lost and when processes may participate only transiently or behave arbitrarily, e.g. after suffering a security breach. In this paper, we propose a scalable architecture to support coordination in such extreme conditions. Our architecture consists of a collection of persistent data servers that implement simple shared data abstractions for clients, without trusting the clients or even the servers themselves. We show that, by interacting with these untrusted servers, clients can solve distributed consensus, a powerful and fundamental coordination primitive. Our architecture is very practical, and we describe the implementation of its main components in a system called Fleet
Keywords :
client-server systems; data structures; fault tolerant computing; reconfigurable architectures; software architecture; Byzantine failures; Fleet; arbitrary process behaviour; client-server systems; distributed consensus; large distributed systems; message delay; message loss; persistent data servers; quorums; scalable architecture; security breach; shared data abstractions; survivable coordination; transient processes; untrusted servers; Data security; Delay systems; Large-scale systems; Protocols; Public key; Publishing; Robustness; Scalability; Software systems; Voting;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.842262
Filename :
842262
Link To Document :
بازگشت