Title :
Fault-tolerant distributed deadlock detection/resolution
Author :
Li, Pei-yu ; McMillin, Bruce
Author_Institution :
Dept. of Comput. Sci., Missouri Univ., Rolla, MO, USA
Abstract :
In an unreliable distributed system, faulty processors may prevent a deadlock detection algorithm from properly detecting deadlocks. However, few of the algorithms proposed in the literature address the issue of handling process failures in a distributed system. This paper proposes a fault-tolerant distributed deadlock detection algorithm which integrates a priority-based probe algorithm with a PMC-based diagnosis model. This algorithm detects deadlock cycles as well as identifies process failures under a bounded number of failures in a deadlock cycle by using extended probe messages that contain additional information about faulty processors
Keywords :
distributed processing; fault tolerant computing; system recovery; PMC-based diagnosis model; deadlock cycles; deadlock detection algorithm; extended probe messages; fault-tolerant distributed deadlock detection algorithm; fault-tolerant distributed deadlock resolution; faulty processors; priority-based probe algorithm; process failures; unreliable distributed system; Computer science; Detection algorithms; Detectors; Fault detection; Fault diagnosis; Fault tolerance; Graphical models; Probes; Resource management; System recovery;
Conference_Titel :
Computer Software and Applications Conference, 1993. COMPSAC 93. Proceedings., Seventeenth Annual International
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-8186-4440-0
DOI :
10.1109/CMPSAC.1993.404188