DocumentCode :
680232
Title :
Automatic capture of provenance data in genome project workflows
Author :
Pinheiro, Rodrigo ; Holanda, Maristela ; Araujo, Aleteia P. F. ; Walter, Maria Emilia ; Lifschitz, Sergio
Author_Institution :
Comput. Sci. Dept., Univ. of Brasilia, Brasilia, Brazil
fYear :
2013
fDate :
18-21 Dec. 2013
Firstpage :
15
Lastpage :
20
Abstract :
Many scientific experiments are designed as computational workflows in the bioinformatics domain, which facilitates implementation and analysis. However, the amount of data generated increases at every phase of each execution, hindering the identification of the source and the data transformation. Therefore, it has become necessary to create new tools to verify automatically which resources and parameters were used to generate the results, among other information to validate and publish the experiment. This functionality of automatically capturing data provenance has been receiving attention in the scientific community, primarily with regard to bioinformatics projects, due the fact that the same workflow is executed several times with different parameters and versions of the tools. In this paper, we propose to use relational schema to automatically store data provenance using the PROV-DM model for workflows in bioinformatics projects.
Keywords :
bioinformatics; data analysis; genomics; PROV-DM model; automatic capture; bioinformatic domain; computational workflows; data generation; data provenance; data transformation; genome project workflows; Bioinformatics; Biological system modeling; Data models; Databases; Genomics; XML; PROV-DM; bioinformatics; data provenace; genome projects; insert; workflow;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
Type :
conf
DOI :
10.1109/BIBM.2013.6732621
Filename :
6732621
Link To Document :
بازگشت