DocumentCode
3048056
Title
Distributed on-line diagnosis in the presence of arbitrary faults
Author
Buskens, Richard W. ; Bianchini, Ronald P., Jr.
Author_Institution
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
1993
fDate
22-24 June 1993
Firstpage
470
Lastpage
479
Abstract
This paper introduces a new fault model for system-level diagnosis and a class of online distributed diagnosis algorithms that operate correctly in the presence of fault nodes that disseminate arbitrarily corrupted diagnostic information. The fault model addresses the practical issue of designing an internode test to cover diagnosis algorithm operation. Since an explicit test to detect arbitrary failures is not practical, evidence of a node´s faulty behavior is provided by examining diagnositic messages exchanged by the node. In many practical systems, algorithm overhead using the new fault model is only twice that required for algorithms using the PMC fault model. The key results include a description of the new fault model, the specification of a class of online distributed diagnosis algorithms that use this fault model, and proofs of their correctness.
Keywords
distributed algorithms; PMC fault model; arbitrary faults; diagnositic messages; distributed online diagnosis; fault model; faulty behavior; internode test; online distributed diagnosis algorithms; system-level diagnosis; Adaptive systems; Algorithm design and analysis; Fault detection; Fault diagnosis; Message passing; Network address translation; Protocols; Robustness; Steady-state; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on
Conference_Location
Toulouse, France
ISSN
0731-3071
Print_ISBN
0-8186-3680-7
Type
conf
DOI
10.1109/FTCS.1993.627350
Filename
627350
Link To Document