DocumentCode :
3120063
Title :
Storing and Discovering Critical Workflows from Log in Scientific Exploration
Author :
Shao, Qihong ; Kinsy, Michel ; Chen, Yi
Author_Institution :
Arizona State Univ., Tempe
fYear :
2007
fDate :
9-13 July 2007
Firstpage :
209
Lastpage :
212
Abstract :
This paper aims at effectively reproducing the results of previous scientific workflow executions. We first identify the information that needs to be recorded in a workflow execution log, based on which we then determine the data flow among experiments. To effectively record a workflow execution log in a relational database management system (RDBMS) when the workflow design is absent, we propose a generic relational storage schema. Then techniques have been designed to automatically discover the minimal set of experiments that must be performed in order to reproduce a scientific result by posing appropriate SQL queries. Although such SQL queries can be evaluated using an off-the-shelf database system, we investigate the unique characteristics of the workflow log data and optimization techniques for evaluating such SQL queries efficiently.
Keywords :
SQL; natural sciences computing; query processing; relational databases; SQL query; off-the-shelf database system; optimization technique; relational database management system; relational storage schema; scientific workflow log data execution; Astronomy; Bioinformatics; Chemicals; Database systems; Physics; Relational databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Services, 2007 IEEE Congress on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-0-7695-2926-4
Type :
conf
DOI :
10.1109/SERVICES.2007.61
Filename :
4278799
Link To Document :
بازگشت