Title :
Almost certain diagnosis for intermittently faulty systems
Author :
Blough, D.M. ; Sullivan, G.F. ; Masson, G.M.
Author_Institution :
Dept. of Comput. Sci., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
The authors present and analyze a uniformly probabilistic model for the self-diagnosis capabilities of a multiprocessor system. In this model an individual processor fails with probability p and a fault-free processor testing a faulty processor detects a fault with probability q, modeling the situation in which processors can be intermittently faulty or the situation where tests are not capable of detecting all possible faults within a processor. They present an efficient algorithm which utilizes a relatively small number of tests (given by any function dominating n log n where n is the number of processors) and achieves correct diagnosis with high probability. They obtain a nearly matching lower bound which shows that no algorithm can achieve correct diagnosis with high probability in systems which conduct a number of tests dominated by n log n. Examples of systems which perform a modest number of tests are given in which the probability of correct diagnosis for the authors´ algorithm is very nearly one.<>
Keywords :
fault tolerant computing; multiprocessing systems; almost certain diagnosis; intermittently faulty systems; multiprocessor system; nearly matching lower bound; uniformly probabilistic model; Computer science; Fault detection; Fault diagnosis; Multiprocessing systems; Performance analysis; Performance evaluation; System testing;
Conference_Titel :
Fault-Tolerant Computing, 1988. FTCS-18, Digest of Papers., Eighteenth International Symposium on
Conference_Location :
Tokyo, Japan
Print_ISBN :
0-8186-0867-6
DOI :
10.1109/FTCS.1988.5329