Title :
Keeping processes under surveillance
Author_Institution :
Dept. of Comput. Sci., Kaiserslautern Univ., Germany
fDate :
30 Sep-2 Oct 1991
Abstract :
Two solutions for the surveillance problem that are based on an election algorithm which has to cope with process and communication failures are described. The election algorithm is presented in detail. The surveillance algorithms are simple and efficient: the central crash detection protocol requires n+1 messages for each surveillance period (assuming that n is the number of processes to keep under surveillance), and the distributed approach requires n messages. If the distributed crash detection approach is used, the election algorithm has to be executed after each crash detection to determine a new ring manager which generates a new token and establishes the virtual ring. In case of a crash detection with the central protocol, a new crash detection manager has to be determined only if the old manager has failed
Keywords :
performance evaluation; protocols; central crash detection protocol; communication failures; distributed approach; distributed crash detection approach; election algorithm; process failures; ring manager; surveillance problem; token; virtual ring; Broadcasting; Computational modeling; Computer crashes; Computer science; Distributed computing; Fault tolerance; Fault tolerant systems; Nominations and elections; Protocols; Surveillance;
Conference_Titel :
Reliable Distributed Systems, 1991. Proceedings., Tenth Symposium on
Conference_Location :
Pisa
Print_ISBN :
0-8186-2260-1
DOI :
10.1109/RELDIS.1991.145424