DocumentCode :
1056183
Title :
Adaptive fault-tolerant deadlock-free routing in meshes and hypercubes
Author :
Su, Chien Chun ; Shin, Kang G.
Author_Institution :
Nantai Coll., Tainan, Taiwan
Volume :
45
Issue :
6
fYear :
1996
fDate :
6/1/1996 12:00:00 AM
Firstpage :
666
Lastpage :
683
Abstract :
We present an adaptive deadlock-free routing algorithm which decomposes a given network into two virtual interconnection networks, VIN1 and VIN2. VIN1 supports deterministic deadlock-free routing, and VIN2 supports fully-adaptive routing. Whenever a channel in VIN1 or VIN2 is available, it can be used to route a message. Each node is identified to be in one of three states: safe, unsafe, and faulty. The unsafe state is used for deadlock-free routing, and an unsafe node can still send and receive messages. When nodes become faulty/unsafe, some channels in VIN2 around the faulty/unsafe nodes are used as the detours of those channels in VIN1 passing through the faulty/unsafe nodes, i.e., the adaptability in VIN 2 is transformed to support fault-tolerant deadlock-free routing. Using information on the state of each node´s neighbors, we have developed an adaptive fault-tolerant deadlock-free routing scheme for n-dimensional meshes and hypercubes with only two virtual channels per physical link. In an n-dimensional hypercube, any pattern of faulty nodes can be tolerated as long as the number of faulty nodes is no more than [n/2]. The maximum number of faulty nodes that can be tolerated is 2n-1, which occurs when all faulty nodes can be encompassed in an (n-1)-cube. In an n-dimensional mesh, we use a more general fault model, called a disconnected rectangular block. Any arbitrary pattern of faulty nodes can be modeled as a rectangular block after finding both unsafe and disabled nodes (which are then treated as faulty nodes). This concept can also be applied to k-ary n-cubes with four virtual channels, two in VIN1 and the other two in VIN2. Finally, we present simulation results for both hypercubes and 2-dimensional meshes by using various workloads and fault patterns
Keywords :
concurrency control; fault tolerant computing; hypercube networks; network routing; adaptive fault-tolerant deadlock-free routing; deterministic deadlock-free routing; fault patterns; fully-adaptive routing; hypercubes; meshes; n-dimensional meshes; virtual channels; virtual interconnection networks; Fault diagnosis; Fault tolerance; Fellows; Hypercubes; Intelligent networks; Multiprocessor interconnection networks; Routing; Switching circuits; System recovery; Throughput;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/12.506423
Filename :
506423
Link To Document :
بازگشت