Title :
An optimal algorithm for distributed system level diagnosis
Author :
Bagchi, Anindo ; Hakimi, S.L.
Author_Institution :
Bellcore, Red Bank, NJ, USA
Abstract :
A system consisting of n identical processors connected by links in which some processors could be faulty is considered. Initially each unit knows only its own i.d. and the i.d.´s of its immediate neighbors; no unit has any global knowledge about the system. An optimal algorithm for system level diagnosis in such a system that is based on the transmission of packets by fault-free units is presented. The algorithm requires at most 3n log p+O(n+pt) message transmissions by fault-free units, where p fault-free units simultaneously start the algorithm and there are t faulty units. The correctness of the algorithm is argued.<>
Keywords :
distributed processing; fault tolerant computing; distributed system level diagnosis; fault tolerant computing; fault-free units; optimal algorithm; transmission of packets; Computer crashes; Distributed computing; Fault diagnosis; Graph theory; Multiprocessor interconnection networks; Performance evaluation; Processor scheduling; Switches; Terminology; Testing;
Conference_Titel :
Fault-Tolerant Computing, 1991. FTCS-21. Digest of Papers., Twenty-First International Symposium
Conference_Location :
Montreal, Quebec, Canada
Print_ISBN :
0-8186-2150-8
DOI :
10.1109/FTCS.1991.146664