DocumentCode
1857681
Title
Distributed recovery block based fault-tolerant routing in hypercube networks
Author
Khan, Gul N. ; Hura, Gurdeep S. ; Wei, Gu
Author_Institution
Electr. & Comput. Eng., Ryerson Univ., Toronto, Ont., Canada
Volume
2
fYear
2002
fDate
2002
Firstpage
603
Abstract
This paper presents a fault-tolerant routing algorithm that employs a modified distributed recovery block (DRB) approach. The section of a parallel or distributed system spanning between the source and destination nodes is partitioned into a series of overlapping DRB groups. Each DRB group consists of three nodes: a current node and two successor nodes. Primary successor executes the primary try while alternate successor executes an alternate try. The primary successor node delivers the message, whereas the alternate is ready to take over if the primary fails. The successful successor in an active DRB group becomes the current node of the next DRB group on the routing path. A prototype version of the routing method is implemented for a hypercube topology and its performance is compared with adaptive routing techniques based on backtracking.
Keywords
fault tolerant computing; hypercube networks; system recovery; distributed recovery block; fault-tolerant routing; hypercube networks; primary successor node; Computer science; Drives; Fault tolerance; Fault tolerant systems; Hamming distance; Hypercubes; Intelligent networks; Ion beams; Network topology; Routing;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 2002. IEEE CCECE 2002. Canadian Conference on
ISSN
0840-7789
Print_ISBN
0-7803-7514-9
Type
conf
DOI
10.1109/CCECE.2002.1013010
Filename
1013010
Link To Document