• DocumentCode
    2177042
  • Title

    Modeling the coverage and effectiveness of fault-management architectures in layered distributed systems

  • Author

    Das, Olivia ; Woodside, C. Murray

  • Author_Institution
    Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont., Canada
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    745
  • Lastpage
    754
  • Abstract
    Increasingly, fault-tolerant distributed software applications use a separate architecture for failure detection instead of coding the mechanisms inside the application itself. Such a structure removes the intricacies of the failure detection mechanisms from the application, and avoids repeating them in every program. However, successful system reconfiguration now depends on the management architecture (which does both fault detection and reconfiguration), and on management subsystem failures, as well as on the application. This paper presents an approach which computes the architecture-based system reconfiguration coverage simultaneously with its performability.
  • Keywords
    distributed processing; software fault tolerance; architecture-based system reconfiguration coverage; fault-management architectures; fault-tolerant distributed software; layered distributed systems; system reconfiguration; Algorithm design and analysis; Application software; Computer architecture; Distributed computing; Failure analysis; Fault detection; Fault tolerance; Fault tolerant systems; Redundancy; Systems engineering and theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Systems and Networks, 2002. DSN 2002. Proceedings. International Conference on
  • Print_ISBN
    0-7695-1101-5
  • Type

    conf

  • DOI
    10.1109/DSN.2002.1029020
  • Filename
    1029020