DocumentCode
2177042
Title
Modeling the coverage and effectiveness of fault-management architectures in layered distributed systems
Author
Das, Olivia ; Woodside, C. Murray
Author_Institution
Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont., Canada
fYear
2002
fDate
2002
Firstpage
745
Lastpage
754
Abstract
Increasingly, fault-tolerant distributed software applications use a separate architecture for failure detection instead of coding the mechanisms inside the application itself. Such a structure removes the intricacies of the failure detection mechanisms from the application, and avoids repeating them in every program. However, successful system reconfiguration now depends on the management architecture (which does both fault detection and reconfiguration), and on management subsystem failures, as well as on the application. This paper presents an approach which computes the architecture-based system reconfiguration coverage simultaneously with its performability.
Keywords
distributed processing; software fault tolerance; architecture-based system reconfiguration coverage; fault-management architectures; fault-tolerant distributed software; layered distributed systems; system reconfiguration; Algorithm design and analysis; Application software; Computer architecture; Distributed computing; Failure analysis; Fault detection; Fault tolerance; Fault tolerant systems; Redundancy; Systems engineering and theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems and Networks, 2002. DSN 2002. Proceedings. International Conference on
Print_ISBN
0-7695-1101-5
Type
conf
DOI
10.1109/DSN.2002.1029020
Filename
1029020
Link To Document