DocumentCode :
3590439
Title :
A Fault Tolerant Workflow for Reproducible Research
Author :
Rodriguez-Pascual, Manuel ; Rubio-Montero, Antonio Juan ; Mayo-Garcia, Rafael ; Kanellopoulos, Christos ; Prnjat, Ognjen ; Darriba, Diego ; Posada, David
Author_Institution :
CIEMAT, Madrid, Spain
fYear :
2014
Firstpage :
70
Lastpage :
75
Abstract :
In this work, the authors present a set of tools to overcome the problem of creating and executing distributed applications on dynamic environments in a resilient way, also ensuring the reproducibility of the performed experiments. The objective is to provide a portable, unattended and fault-tolerant set of tools, encapsulating the infrastructure-dependent operations away from the application developers and users, allowing to perform experiments based on open access data repositories. In this way, users can seamlessly search and lately access datasets that can be automatically retrieved as input data into a code already integrated in the proposed workflow. Such a search is based on metadata standards and relies on Persistent Identifiers (PID) to assign specific repositories. The applications profit from Distributed Toolbox, a newly created framework devoted to the creation and execution of distributed applications and includes tools for unattended Cluster and Grid execution, where a total fault tolerance is provided. By decoupling the definition of the remote tasks from its execution and control, the development, execution and maintenance of distributed applications is significantly simplified with respect to previous solutions, increasing their robustness and allowing running them on different computational platforms with little effort. The integration with open access databases and employment of PIDs for long-lasting references ensures that the data related to the experiments will persist, closing a complete research circle of data access / processing/ storage / dissemination of results.
Keywords :
distributed processing; software fault tolerance; PID; access datasets; distributed applications; distributed toolbox; dynamic environments; fault tolerant workflow; grid execution; infrastructure dependent operations; metadata standards; open access data repositories; persistent identifiers; reproducible research; Databases; Electronic mail; Fault tolerance; Fault tolerant systems; Open Access; Software; Cluster; dynamic; fault tolerant; grid; workflow reproducibility;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Computer Technology (GOCICT), 2014 Annual Global Online Conference on
Type :
conf
DOI :
10.1109/GOCICT.2014.10
Filename :
7113668
Link To Document :
بازگشت