DocumentCode :
1940981
Title :
Dynamic snooping in a fault-tolerant distributed shared memory
Author :
Brown, Lmy ; Wu, Jie
Author_Institution :
Dept. of Comput. Sci. & Eng., Florida Atlantic Univ., Boca Raton, FL, USA
fYear :
1994
fDate :
21-24 Jun 1994
Firstpage :
218
Lastpage :
226
Abstract :
Distributed shared memory (DSM) allows multicomputer systems with no physically shared memory to be programmed using a shared memory paradigm. However, as the number of nodes in a system increases the probability of a failure that can corrupt the DSM increases. This paper presents a fault-tolerant DSM (FTDSM) algorithm that can tolerate single node failures. Each page in the DSM is assigned a snooper that keeps a backup copy of the page and can take over if the page owner fails. The snooper is dynamic because the responsibility for snooping a page can migrate front node to node. The FTDSM presented is an improvement over other FTDSMs because it is scalable, is based on the efficient dynamic distributed manager (DDM) DSM algorithm, does not require the repair of a failed processor to access the DSM, and does not query all nodes to rebuild the state of the DSM. It is shown that any single node failure can be tolerated because either the owner or the snooper of a page can always be found
Keywords :
distributed algorithms; distributed memory systems; fault tolerant computing; multiprocessing programs; reliability; shared memory systems; software reliability; transaction processing; dynamic distributed manager; dynamic snooping; fault-tolerant DSM; fault-tolerant distributed shared memory; multicomputer systems; shared memory paradigm; single node failures; Access protocols; Computer science; Control systems; Delay; Distributed computing; Distributed decision making; Fault tolerance; Multiprocessor interconnection networks; Physics computing; Programming profession;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Computing Systems, 1994., Proceedings of the 14th International Conference on
Conference_Location :
Pozman
Print_ISBN :
0-8186-5840-1
Type :
conf
DOI :
10.1109/ICDCS.1994.302415
Filename :
302415
Link To Document :
بازگشت