• DocumentCode
    964305
  • Title

    Fault-tolerant meshes and hypercubes with minimal numbers of spares

  • Author

    Bruck, Jehoshua ; Cypher, Robert ; Ho, Ching-Tien

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • Volume
    42
  • Issue
    9
  • fYear
    1993
  • fDate
    9/1/1993 12:00:00 AM
  • Firstpage
    1089
  • Lastpage
    1104
  • Abstract
    This paper presents several techniques for tolerating faults in d-dimensional mesh and hypercube architectures. The approach consists of adding spare processors and communication links so that the resulting architecture will contain a fault-free mesh or hypercube in the presence of faults. The authors optimize the cost of the fault-tolerant architecture by adding exactly k spare processors (while tolerating up to k processor and/or link faults) and minimizing the maximum number of links per processor. For example, when the desired architecture is a d-dimensional mesh and k=1, they present a fault-tolerant architecture that has the same maximum degree as the desired architecture (namely, 2d) and has only one spare processor. They also present efficient layouts for fault-tolerant two- and three-dimensional meshes, and show how multiplexers and buses can be used to reduce the degree of fault-tolerant architectures. Finally, they give constructions for fault-tolerant tori, eight-connected meshes, and hexagonal meshes
  • Keywords
    fault tolerant computing; hypercube networks; performance evaluation; buses; d-dimensional mesh; fault-tolerant architecture; fault-tolerant meshes; hexagonal meshes; hypercubes; multiplexers; tori; Computer architecture; Concurrent computing; Cost function; Fault tolerance; Global communication; Hypercubes; Multiplexing; Parallel machines; Switches; Topology;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/12.241598
  • Filename
    241598