Title :
An effective approach for achieving fault tolerance in hypercubes
Author :
Al-Tawil, Khalid M. ; Avresky, Dimiter R.
Author_Institution :
Dept. of Comput. Sci., Texas A&M Univ., College Station, TX, USA
Abstract :
The hypercube network is an attractive structure for parallel processing because of its regularity. The problem of tolerating faulty processors in hypercubes has been studied by many researchers, either by using spares or by reconfiguration. In this paper, we present algorithms for achieving fault tolerance in hypercubes using spanning trees, without requiring additional spare nodes. We present two algorithms; one uses completely unbalanced spanning trees (CUST) and the other uses balanced spanning trees (BST). Both algorithms use, at most, one used link and one unused link for every reconstructed path in the reconfigured hypercube. The algorithms are optimal, in terms of the reconfiguration time and may increase the congestion of a link by, at most, one with no extra-dilation. Single-fault coverage of 100% and almost 100% fault coverage of double and triple faults are achieved by the proposed algorithms for hypercubes having a dimension of n⩾10
Keywords :
fault tolerant computing; hypercube networks; parallel processing; balanced spanning trees; completely unbalanced spanning trees; fault tolerance; faulty processors; hypercubes; parallel processing; reconfiguration; spanning trees; Binary search trees; Computer science; Fault tolerance; Hamming distance; Hypercubes; Intelligent networks; Joining processes; Parallel machines; Parallel processing;
Conference_Titel :
Fault-Tolerant Parallel and Distributed Systems, 1994., Proceedings of IEEE Workshop on
Conference_Location :
College Station, TX
Print_ISBN :
0-8186-6807-5
DOI :
10.1109/FTPDS.1994.494482