DocumentCode :
2573708
Title :
Dependable parallel computing with agents based on a task graph model
Author :
Chabridon, Sophie ; Gelenbe, Erol
Author_Institution :
UFR de Math. et Inf., Univ. Rene Descartes, Paris, France
fYear :
1995
fDate :
25-27 Jan 1995
Firstpage :
350
Lastpage :
357
Abstract :
We discuss a novel technique for improving the dependability of parallel programs executing on a MIMD shared memory architecture. The idea is to empower certain tasks of each application program to carry out failure detection, and to reschedule the execution of those tasks which are considered to have failed. The technique we propose is based on a task graph representation of the parallel program, in which communications between tasks have been voluntarily isolated to the end of each task which is being considered. We propose and evaluate several algorithms which can detect failures and restart failed tasks. A discrete-event simulator is used to evaluate the performance under the effect of failures, with the use of our detection and restart algorithms, of a specific parallel application: the fast Fourier transform
Keywords :
discrete event simulation; parallel processing; parallel programming; software performance evaluation; MIMD shared memory architecture; agents; application program; dependable parallel computing; discrete-event simulator; failure detection; fast Fourier transform; parallel programs; performance evaluation; task graph model; Algorithm design and analysis; Computational modeling; Computer architecture; Concurrent computing; Discrete event simulation; Fast Fourier transforms; Hardware; Memory architecture; Parallel processing; Very large scale integration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing, 1995. Proceedings. Euromicro Workshop on
Conference_Location :
San Remo
Print_ISBN :
0-8186-7031-2
Type :
conf
DOI :
10.1109/EMPDP.1995.389188
Filename :
389188
Link To Document :
بازگشت