Title :
Three types of fault coverage in multi-state systems
Author :
Levitin, Gregory ; Amari, Suprasad
Author_Institution :
Israel Electr. Corp. Ltd., Haifa, Israel
Abstract :
Fault-tolerance is an essential architectural attribute for achieving high reliability in many critical applications of digital systems. Automatic fault and error handling mechanisms play a crucial role in implementing fault tolerance because an uncovered (undetected) fault may lead to a system or a subsystem failure even when adequate redundancy exists. Examples of this effect can be found in computing systems, electrical power distribution networks, pipelines carrying dangerous materials etc. Because an uncovered fault may lead to overall system failure, an excessive level of redundancy may even reduce the system reliability. We consider three types of coverage models: 1. element level coverage where the fault coverage probability of an element does not depend on the states of other elements; 2. the multi-fault coverage where the effectiveness of recovery mechanisms depends on the coexistence of multiple faults in a group of elements that collectively participate in detecting and recovering the faults in that group; 3. the performance dependent coverage where the effectiveness of recovery mechanisms in a group depends on the entire performance level of this group. The paper presents a modification of the generalized reliability block diagram (RBD) method for evaluating reliability and performance indices of complex multi-state series-parallel systems with all these types of fault coverage. The suggested method based on a universal generating function technique allows the system performance distribution to be obtained using a straightforward recursive procedure.
Keywords :
fault tolerance; power distribution faults; power engineering computing; power system reliability; electrical power distribution; element level coverage; fault-tolerance; multifault coverage; multistate series-parallel system; multistate systems; performance dependent coverage; reliability block diagram; system reliability; Computer networks; Digital systems; Distributed computing; Fault detection; Fault tolerant systems; Pipelines; Power system modeling; Power systems; Redundancy; Reliability; imperfect fault coverage; multi-fault coverage; multi-state system; reliability; universal generating function;
Conference_Titel :
Reliability, Maintainability and Safety, 2009. ICRMS 2009. 8th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-4903-3
Electronic_ISBN :
978-1-4244-4905-7
DOI :
10.1109/ICRMS.2009.5270224