Title :
ReGS: user-level reliability in a grid environment
Author :
Sanches, José Afonso Lajas ; Vargas, Patrícia Kayser ; de Castro Dutra, I. ; Costa, Vítor Santos ; Geyer, Claudio F. R.
Author_Institution :
COPPE, Univ. Fed. do Rio de Janeiro, Brazil
Abstract :
Grid environments are ideal for executing applications that require a huge amount of computational work, both due to the big number of tasks to execute and to the large amount of data to be analysed. Unfortunately, current tools may require that users deal themselves with corrupted outputs or early termination of tasks. This becomes inconvenient as the number of parallel runs grows to easily exceed the thousands. ReGS is a user-level software designed to provide automatic detection and restart of corrupted or early terminated tasks. ReGS uses a Web interface to allow the setup and control of grid execution, and provides automatic input data setup. ReGS allows the automatic detection of job dependencies, through the GRID-ADL task management language. Our results show that besides automatically and effectively managing a huge number of tasks in grid environments, ReGS is also a good monitoring tool to spot grid nodes pitfalls.
Keywords :
Internet; grid computing; software reliability; specification languages; user interfaces; GRID-ADL task management language; Web interface; grid environment; reliable grid submission; user-level reliability; user-level software; Application software; Automatic control; Computerized monitoring; Environmental management; Machine learning; Physics; Prototypes; Resource management; Software design; Software prototyping;
Conference_Titel :
Cluster Computing and the Grid, 2005. CCGrid 2005. IEEE International Symposium on
Print_ISBN :
0-7803-9074-1
DOI :
10.1109/CCGRID.2005.1558634