• DocumentCode
    815382
  • Title

    Intermittent fault diagnosis in multiprocessor systems

  • Author

    Blough, Douglas M. ; Sullivan, Gregory F. ; Masson, Gerald M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., California Univ., Irvine, CA, USA
  • Volume
    41
  • Issue
    11
  • fYear
    1992
  • fDate
    11/1/1992 12:00:00 AM
  • Firstpage
    1430
  • Lastpage
    1441
  • Abstract
    The authors present and analyze a probabilistic model for the self-diagnosis capabilities of a multiprocessor system. In this model an individual processor fails with probability p and a nonfaulty processor testing a faulty processor detects a fault with probability q. This models the situation where processors can be intermittently faulty or the situation where tests are not capable of detecting all possible faults within a processor. An efficient algorithm that can achieve correct diagnosis with high probability in systems of O(n log n) connections, where n is the number of processors, is presented. It is the first algorithm to be able to diagnose a large number of intermittently faulty processors in a class of systems that includes hypercubes. It is shown that, under this model, no algorithm can achieve correct diagnosis with high probability in regular systems which conduct a number of tests dominated by n log n. Examples of systems which perform a modest number of tests are given in which the probability of correct diagnosis for the algorithm is very nearly one
  • Keywords
    fault tolerant computing; multiprocessing systems; hypercubes; intermittent fault diagnosis; multiprocessor systems; nonfaulty processor; probabilistic model; self-diagnosis; Computer networks; Concrete; Concurrent computing; Fault detection; Fault diagnosis; Hardware; Hypercubes; Multiprocessing systems; Performance evaluation; System testing;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/12.177313
  • Filename
    177313