• DocumentCode
    2753056
  • Title

    Identifying the cause of detected errors

  • Author

    Walter, C.J.

  • Author_Institution
    Allied-Signal Aerosp. Co., Columbia, MD, USA
  • fYear
    1990
  • fDate
    26-28 June 1990
  • Firstpage
    48
  • Lastpage
    55
  • Abstract
    The author presents an approach to the consistent diagnosis of error monitoring observations in a distributed fault-tolerant computing system, even when the faulty source produces arbitrary errors. He describes the online algorithm used in the multicomputer architecture for fault tolerance (MAFT) to diagnose faulty system elements. By the use of syndrome information which categorizes detected errors as either symmetric or asymmetric, bounds for correct diagnosis can be deduced. Finally, an interactive consistency algorithm is employed to guarantee consistent diagnosis in a distributed environment and to provide online verification of all diagnostic units.<>
  • Keywords
    computer architecture; distributed processing; fault tolerant computing; arbitrary errors; consistent diagnosis; diagnostic units; distributed fault-tolerant computing system; error monitoring observations; faulty source; interactive consistency algorithm; multicomputer architecture for fault tolerance; online algorithm; online verification; Aerodynamics; Distributed computing; Fault detection; Fault diagnosis; Fault tolerant systems; Hardware; Monitoring; Redundancy; Testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1990. FTCS-20. Digest of Papers., 20th International Symposium
  • Conference_Location
    Newcastle Upon Tyne, UK
  • Print_ISBN
    0-8186-2051-X
  • Type

    conf

  • DOI
    10.1109/FTCS.1990.89365
  • Filename
    89365