Title :
Performance of fault-tolerant diagnostics in the hypercube systems
Author :
Ghafoor, Arif ; Solé, Patrick
Author_Institution :
Dept. of Electr. & Comput. Eng., Syracuse Univ., NY, USA
fDate :
8/1/1989 12:00:00 AM
Abstract :
The concept of fault-tolerant self-diagnostics is introduced for distributed systems, and it is shown that there exists a performance tradeoff between the complexity of a self-diagnostic algorithm and the level of fault tolerance inherited by the algorithm. Hypercube systems are selected, and it is shown that designing an optimal algorithm for such systems has an equivalent coding theory formulation which belongs to the case of NP-hard problems. An efficient diagnostic scheme is proposed for these systems, and the performance tradeoff of the proposed algorithm, which is based on a combinatorial structure called the Hadamard matrix, is studied. The tradeoff between the fault tolerance and traffic complexity of the proposed diagnostic algorithm for hypercubes of small size is evaluated. An interesting compromise is exhibited for the hypercube with an arbitrary size
Keywords :
fault tolerant computing; multiprocessing systems; performance evaluation; Hadamard matrix; NP-hard problems; coding theory; combinatorial structure; complexity; diagnostic scheme; distributed systems; fault-tolerant diagnostics; hypercube systems; optimal algorithm; performance evaluation; self-diagnostic algorithm; Algorithm design and analysis; Automatic testing; Codes; Costs; Fault tolerance; Fault tolerant systems; Hypercubes; Multiprocessor interconnection networks; Partitioning algorithms; Software testing;
Journal_Title :
Computers, IEEE Transactions on