Title :
Storing and Discovering Critical Workflows from Log in Scientific Exploration
Author :
Shao, Qihong ; Kinsy, Michel ; Chen, Yi
Author_Institution :
Arizona State Univ., Tempe
Abstract :
This paper aims at effectively reproducing the results of previous scientific workflow executions. We first identify the information that needs to be recorded in a workflow execution log, based on which we then determine the data flow among experiments. To effectively record a workflow execution log in a relational database management system (RDBMS) when the workflow design is absent, we propose a generic relational storage schema. Then techniques have been designed to automatically discover the minimal set of experiments that must be performed in order to reproduce a scientific result by posing appropriate SQL queries. Although such SQL queries can be evaluated using an off-the-shelf database system, we investigate the unique characteristics of the workflow log data and optimization techniques for evaluating such SQL queries efficiently.
Keywords :
SQL; natural sciences computing; query processing; relational databases; SQL query; off-the-shelf database system; optimization technique; relational database management system; relational storage schema; scientific workflow log data execution; Astronomy; Bioinformatics; Chemicals; Database systems; Physics; Relational databases;
Conference_Titel :
Services, 2007 IEEE Congress on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-0-7695-2926-4
DOI :
10.1109/SERVICES.2007.61