DocumentCode
1685695
Title
System-diagnosis of cluster-based parallel architectures
Author
Benkahla, O. ; Aktouf, C. ; Robach, C.
Author_Institution
Lab. de Genie Inf., IMAG, Grenoble, France
fYear
1996
Firstpage
305
Lastpage
309
Abstract
The paper explores the diagnosis of cluster based parallel architectures. A hierarchical strategy which is well suited to such architectures is proposed. This strategy avoids a costly full distributed diagnosis of the network by running an adaptive diagnosis algorithm into each cluster and collecting all the test results at the host level. Key results of the paper include realistic fault and architecture models, an adaptive cluster diagnosis algorithm and a global diagnosis strategy of cluster based parallel machines
Keywords
fault diagnosis; parallel architectures; parallel machines; performance evaluation; adaptive cluster diagnosis algorithm; adaptive diagnosis algorithm; cluster based parallel architectures; cluster based parallel machines; global diagnosis strategy; hierarchical strategy; host level; system diagnosis; test results; Automatic testing; Clustering algorithms; Communication system control; Computer architecture; Concurrent computing; Fault diagnosis; Parallel architectures; Parallel machines; Partitioning algorithms; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing, 1996. PDP '96. Proceedings of the Fourth Euromicro Workshop on
Conference_Location
Braga
Print_ISBN
0-8186-7376-1
Type
conf
DOI
10.1109/EMPDP.1996.500601
Filename
500601
Link To Document