• DocumentCode
    2756460
  • Title

    Distributed probabilistic fault diagnosis for multiprocessor systems

  • Author

    Berman, P. ; Pelc, A.

  • Author_Institution
    Dept. of Comput. Sci., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    1990
  • fDate
    26-28 June 1990
  • Firstpage
    340
  • Lastpage
    346
  • Abstract
    A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n to infinity is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n to infinity . By combining the probabilistic and the distributed approach the authors´ model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.<>
  • Keywords
    distributed processing; fault tolerant computing; multiprocessing systems; distributed probabilistic fault diagnosis; fault-free central monitoring unit; multiprocessor systems; probability of correctness; Computer science; Contracts; Councils; Fault detection; Fault diagnosis; Hypercubes; Multiprocessing systems; Nose; System testing; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1990. FTCS-20. Digest of Papers., 20th International Symposium
  • Conference_Location
    Newcastle Upon Tyne, UK
  • Print_ISBN
    0-8186-2051-X
  • Type

    conf

  • DOI
    10.1109/FTCS.1990.89383
  • Filename
    89383