Title :
Adaptive Algorithm-Based Fault Tolerance for Parallel Computing in Linear Systems
Author :
Khan, Javed I. ; Lin, W. ; Yun, D.Y.Y.
Abstract :
This paper presents a dynamically adaptive stabilization scheme for parallel matrix computation. The scheme performs automatic error detection and correction through inserting redundant, but concurrent tracer computations within the folds of the regular computation. It also eliminates the costly row interchange used in classical pivoting. A fault-tolerant double wavefront matrix algorithmfor a MIMD array multi-processor with toroidal inter connection has been designed to demonstrate the strength of the proposed scheme. This algorithm can compute: i) matrix inverse ii) solution vector to the linear system and Hi) predetermined linear combination of the solution vector from identical algorithmic framework. This efficient tri-solution algorithm excels most other known methods in parallel performance.
Conference_Titel :
Parallel Processing, 1994. ICPP 1994 Volume 3. International Conference on
Conference_Location :
North Carolina, USA
Print_ISBN :
0-8493-2493-9
DOI :
10.1109/ICPP.1994.49