DocumentCode
1294978
Title
Dynamic fault reconfiguration in a mesh-connected MIMD environment
Author
Uyar, M. Ümit ; Reeves, Anthony P.
Author_Institution
AT&T Bell Lab., Holmdel, NJ, USA
Volume
37
Issue
10
fYear
1988
fDate
10/1/1988 12:00:00 AM
Firstpage
1191
Lastpage
1205
Abstract
The near-neighbor problem is characterized by many iterations of a parallel matrix operation in which each matrix element is recomputed as a function of itself and its immediately adjacent near neighbors. Several highly parallel computer systems have been designed with the near-neighbor class of problems as the target application. As the number of processors in evolving parallel computer systems increases, the capability of fault tolerance to processor failures becomes more important. The authors show how fault tolerance can be efficiently achieved on an MIMD (multiple-instruction, multiple-data-stream) computer system for the near-neighbor problem by task redistribution. The techniques discussed minimize the extra data transfers and/or the extra computation in the system with faulty processors and links
Keywords
fault tolerant computing; parallel processing; data transfers; dynamic default reconfiguration; fault tolerance; highly parallel computer systems; mesh-connected MIMD environment; near-neighbor problem; parallel matrix operation; processor failures; Application software; Concurrent computing; Costs; Fault tolerance; Fault tolerant systems; Hardware; Image processing; Large-scale systems; Parallel processing; Partial differential equations;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/12.5981
Filename
5981
Link To Document