Title :
Dynamic data reconfiguration for SPMD programs in faulty multicomputers
Author :
Angelaccio, M. ; Colajanni, M. ; Grassi, V.
Author_Institution :
Dipartimento di Ingegneria Elettronica, Rome Univ., Italy
Abstract :
The single-program multiple-data (SPMD) paradigm is becoming the most diffuse way to program commercial multicomputers. In this paper we demonstrate that for a wide class of SPMD algorithms it is possible to achieve an efficient fault tolerance avoiding hardware redundancy. We propose a software approach that aims to reconfigure data, thus achieving a good slowdown in computation owing to the fine granularity of the workload to redistribute. In particular, we present and compare three data reconfiguration strategies applied to a problem model that includes a wide class of SPMD iterative algorithms characterized by nonlocal communications among the nodes. The result is that in most of the cases it is better to introduce some communication overhead than to leave idle a few healthy processors
Keywords :
fault tolerant computing; multiprocessing systems; SPMD programs; communication overhead; dynamic data reconfiguration; fault tolerance; faulty multicomputers; granularity; hardware redundancy; nonlocal communications; single-program multiple-data paradigm; software approach; Degradation; Dynamic programming; Fault tolerance; Fault tolerant systems; Hardware; Hypercubes; Iterative algorithms; Jacobian matrices; Parallel processing; Redundancy;
Conference_Titel :
Fault-Tolerant Parallel and Distributed Systems, 1994., Proceedings of IEEE Workshop on
Conference_Location :
College Station, TX
Print_ISBN :
0-8186-6807-5
DOI :
10.1109/FTPDS.1994.494486