DocumentCode :
2646349
Title :
System reliability analysis of an N-version programming application
Author :
Dugan, Joanne Bechta ; Lyu, Michael R.
Author_Institution :
Dept. of Electr. Eng., Virginia Univ., Charlottesville, VA, USA
fYear :
1993
fDate :
3-6 Nov 1993
Firstpage :
103
Lastpage :
111
Abstract :
This paper presents a quantitative reliability analysis of a system designed to tolerate both hardware and software faults. The system being studied achieves integrated fault tolerance by implementing N-version programming (NVP) on redundant hardware. The analysis of the system considers independent software faults, related software faults, transient hardware faults, permanent hardware faults, and imperfect coverage. The overall model is a Markov reward model in which the states of the Markov chain represent the long-term evolution of the structure of the system. For each operational configuration, a fault tree model captures the effects of software faults and transient hardware faults on the task computation. The fault tree models define the reward structure for the overall model. The software fault model is parameterized using experimental data associated with a recent implementation of an NVP system using the current design paradigm, in which the predictions of software failures are very close to the empirical data. The hardware model is parameterized by considering typical failure rates associated with hardware faults and coverage parameters. Results from our study show that it is important to consider both hardware and software faults in the reliability analysis of an NVP system, since these estimates increase with time. Moreover, the function for error detection and recovery is extremely important to fault-tolerant software
Keywords :
Markov processes; fault tolerant computing; reliability; software fault tolerance; system recovery; systems analysis; Markov chain; Markov reward model; N-version programming; NVP system; error detection; error recovery; failure rates; fault tree model; fault-tolerant software; imperfect coverage; integrated fault tolerance; permanent hardware faults; quantitative reliability analysis; redundant hardware; reliability analysis; reward structure; software failures; software faults; task computation; transient hardware faults; Application software; Fault detection; Fault tolerance; Fault tolerant systems; Fault trees; Hardware; Predictive models; Reliability; Software systems; Transient analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Reliability Engineering, 1993. Proceedings., Fourth International Symposium on
Conference_Location :
Denver, CO
Print_ISBN :
0-8186-4010-3
Type :
conf
DOI :
10.1109/ISSRE.1993.624279
Filename :
624279
Link To Document :
بازگشت