DocumentCode
3163321
Title
Fault-tolerant meshes with minimal numbers of spares
Author
Bruck, Jehoshua ; Cypher, Robert ; Ho, Ching-Tien
Author_Institution
IBM Almaden Res. Center, San Jose, CA, USA
fYear
1991
fDate
2-5 Dec 1991
Firstpage
288
Lastpage
295
Abstract
Presents several techniques for adding fault-tolerance to distributed memory parallel computers. More formally, given a target graph with n nodes, the authors create as fault-tolerant graph with n +k nodes such that given any set of k or fewer faulty nodes, the remaining graph is guaranteed to contain the target graph as a fault-free subgraph. As a result, any algorithm designed for the target graph will run with no slowdown in the presence of k or fewer node faults, regardless of their distribution. The authors present fault-tolerant graphs for target graphs which are 2-dimensional meshes, tori, eight-connected meshes and hexagonal meshes. In all cases these fault-tolerant graphs have smaller degree than any previously known graphs with the same properties
Keywords
distributed memory systems; fault tolerant computing; parallel architectures; distributed memory; fault-tolerance; fault-tolerant graphs; fault-tolerant meshes; parallel computers; target graph; Algorithm design and analysis; Costs; Fabrication; Fault tolerance; Large scale integration; Microprocessors; Parallel machines; Redundancy; Switches; Tree graphs;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing, 1991. Proceedings of the Third IEEE Symposium on
Conference_Location
Dallas, TX
Print_ISBN
0-8186-2310-1
Type
conf
DOI
10.1109/SPDP.1991.218267
Filename
218267
Link To Document