Title :
A general framework for developing adaptive fault-tolerant routing algorithms
Author :
El-Ghazawi, Tarek ; Youssef, Abdou
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., George Washington Univ., Washington, DC, USA
fDate :
6/1/1993 12:00:00 AM
Abstract :
It is shown that Cartesian product (CP) graph-based network methods provide a useful framework for the design of reliable parallel computer systems. Given component networks with prespecified connectivity, more complex networks with known connectivity and terminal reliability can be developed. CP networks provide systematic techniques for developing reliable fault-tolerant routing schemes, even for very complex topological structures. The authors establish the theoretical foundations that relate the connectivity of a CP network, the connectivity of the component networks, and the number of faulty components: present an adaptive generic algorithm that can perform successful point-to-point routing in the presence of faults: synthesize, using the theoretical results, this adaptive fault-tolerant algorithm from algorithms written for the component networks: prove the correctness of the algorithm: and show that the algorithm ensures following an optimal path, in the presence of many faults, with high probability
Keywords :
fault tolerant computing; genetic algorithms; parallel machines; software reliability; Cartesian product; adaptive generic algorithm; complex topological structures; connectivity; design; fault tolerant computing; graph-based network methods; parallel computer systems; point-to-point routing; probability; routing algorithms; software reliability; systematic techniques; Algorithm design and analysis; Fault tolerance; Graph theory; Hypercubes; Mathematics; Multiprocessor interconnection networks; Network synthesis; Performance analysis; Reliability theory; Routing;
Journal_Title :
Reliability, IEEE Transactions on