Title :
Immucube: Scalable Fault-Tolerant Routing for k-ary n-cube Networks
Author :
Puente, Valentin ; Gregorio, Jose Ángel
Author_Institution :
ETSIIT, Cantabria Univ., Santander
fDate :
6/1/2007 12:00:00 AM
Abstract :
This work presents Immucube, a scalable and efficient mechanism to improve dependability of interconnection networks for parallel and distributed computers. Immucube achieves better flexibility and scalability than any other previous fault-tolerant mechanism in k-ary n-cubes. The proposal inherits from Immunet several advantages over other previous fault-tolerant routing algorithms: 1) allowing any temporal and spatial fault combination, 2) permitting automatic and application-transparent reconfiguration after any fault, and 3) requiring a negligible overhead in the absence of faults. Immucube introduces new important features, such as: 4) providing graceful performance degradation, even in very large interconnection networks, 5) tolerating transparent resource utilization after transitory faults or partial repair of faulty resources, 6) being able to deal with intermittent faults, and 7) being able to dynamically recover the original network performance when all the failed components have been repaired
Keywords :
fault tolerance; multiprocessor interconnection networks; network routing; parallel processing; application-transparent reconfiguration; immucube; interconnection networks; k-ary n-cube networks; parallel distributed computers; scalable fault-tolerant routing; transparent resource utilization; Computer networks; Concurrent computing; Degradation; Distributed computing; Fault tolerance; Multiprocessor interconnection networks; Proposals; Resource management; Routing; Scalability; Interconnection networks; fault-tolerant routing; k{hbox{-}}aryn{hbox{-}}cubes.;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
DOI :
10.1109/TPDS.2007.1047