Title :
Distributed on-line diagnosis in the presence of arbitrary faults
Author :
Buskens, Richard W. ; Bianchini, Ronald P., Jr.
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
This paper introduces a new fault model for system-level diagnosis and a class of online distributed diagnosis algorithms that operate correctly in the presence of fault nodes that disseminate arbitrarily corrupted diagnostic information. The fault model addresses the practical issue of designing an internode test to cover diagnosis algorithm operation. Since an explicit test to detect arbitrary failures is not practical, evidence of a node´s faulty behavior is provided by examining diagnositic messages exchanged by the node. In many practical systems, algorithm overhead using the new fault model is only twice that required for algorithms using the PMC fault model. The key results include a description of the new fault model, the specification of a class of online distributed diagnosis algorithms that use this fault model, and proofs of their correctness.
Keywords :
distributed algorithms; PMC fault model; arbitrary faults; diagnositic messages; distributed online diagnosis; fault model; faulty behavior; internode test; online distributed diagnosis algorithms; system-level diagnosis; Adaptive systems; Algorithm design and analysis; Fault detection; Fault diagnosis; Message passing; Network address translation; Protocols; Robustness; Steady-state; System testing;
Conference_Titel :
Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on
Conference_Location :
Toulouse, France
Print_ISBN :
0-8186-3680-7
DOI :
10.1109/FTCS.1993.627350