• DocumentCode
    2601089
  • Title

    Bayesian analysis for fault location in homogeneous distributed systems

  • Author

    Chang, Yu Lo Cyrus ; Lander, Leslie C. ; Lu, Horng-Shing ; Wells, Martin T.

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York, Binghamton, NY, USA
  • fYear
    1993
  • fDate
    6-8 Oct 1993
  • Firstpage
    44
  • Lastpage
    53
  • Abstract
    A simple and practical probabilistic comparison-based model, employing multiple incomplete test concepts, for handling fault location in distributed systems using a Bayesian analysis procedure is proposed. This approach is more practical and complete than previous ones since it does not assume any conditions such as permanently faulty units, complete tests, perfect environments, or non-malicious environments. Fault-free systems are handled without overhead; hence, the test procedure may be used to monitor a functioning system. Given a system S with a specific test graph, the corresponding conditional distribution between the comparison test results (syndrome) and the fault patterns of S can be generated. To avoid the complex global Bayesian estimation process, a simple bitwise Bayesian algorithm is developed for fault location in S, which locates system failures with linear complexity, suitable for hard real-time systems
  • Keywords
    Bayes methods; distributed processing; failure analysis; program testing; Bayesian analysis procedure; bitwise Bayesian algorithm; comparison test results; conditional distribution; fault location; fault patterns; functioning system; global Bayesian estimation process; hard real-time systems; homogeneous distributed systems; linear complexity; multiple incomplete test concepts; practical probabilistic comparison-based model; specific test graph; test procedure; Bayesian methods; Computational complexity; Computer science; Fault diagnosis; Fault location; Inference algorithms; Real time systems; System testing; Table lookup; Test pattern generators;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 1993. Proceedings., 12th Symposium on
  • Conference_Location
    Princeton, NJ
  • Print_ISBN
    0-8186-4310-2
  • Type

    conf

  • DOI
    10.1109/RELDIS.1993.393474
  • Filename
    393474