DocumentCode
3280433
Title
Fault tolerant distributed shared memory algorithms
Author
Stumm, Michael ; Zhou, Songnian
Author_Institution
Toronto Univ., Ont., Canada
fYear
1990
fDate
9-13 Dec 1990
Firstpage
719
Lastpage
724
Abstract
Distributed shared memory (DSM) has received increased attention as a mechanism for interprocess communication in loosely-coupled distributed systems because of its perceived advantages over direct use of message passing or remote procedure calls. One problem with most DSM algorithms proposed to date, however, is that they do not tolerate faults. The paper extends four basic DSM algorithms to tolerate single host failures and argues that this degree of fault tolerance is sufficient for most applications. It analyzes the performance behavior of the fault tolerant DSM algorithms and shows that for some algorithms the additional overhead for fault tolerance is quite small, but that for other algorithms the extra overhead can be substantial and even unpredictable
Keywords
distributed processing; fault tolerant computing; performance evaluation; distributed shared memory algorithms; fault tolerance; interprocess communication; loosely-coupled distributed systems; performance behavior; single host failures; Algorithm design and analysis; Application software; Clustering algorithms; Costs; Fault tolerance; Fault tolerant systems; Message passing; Performance analysis; Read-write memory; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing, 1990. Proceedings of the Second IEEE Symposium on
Conference_Location
Dallas, TX
Print_ISBN
0-8186-2087-0
Type
conf
DOI
10.1109/SPDP.1990.143633
Filename
143633
Link To Document