Title :
Storing and Querying Scientific Workflow Provenance Metadata Using an RDBMS
Author :
Chebotko, Artem ; Fei, Xubo ; Lin, Cui ; Lu, Shiyong ; Fotouhi, Farshad
Author_Institution :
Wayne State Univ., Detroit
Abstract :
Provenance management has become increasingly important to support scientific discovery reproducibility, result interpretation, and problem diagnosis in scientific workflow environments. This paper proposes an approach to provenance management that seamlessly integrates the interoperability, extensibility, and reasoning advantages of semantic Web technologies with the storage and querying power of an RDBMS. Specifically, we propose: i) two schema mapping algorithms to map an arbitrary OWL provenance ontology to a relational database schema that is optimized for common provenance queries; ii) two efficient data mapping algorithms to map provenance RDF metadata to relational data according to the generated relational database schema, and iii) a schema-independent SPARQL-to-SQL translation algorithm that is optimized on-the-fly by using the type information of an instance available from the input provenance ontology and the statistics of the sizes of the tables in the database. Experimental results are presented to show that our algorithms are efficient and scalable.
Keywords :
meta data; relational databases; semantic Web; data mapping algorithms; provenance management; provenance ontology; relational database schema; scientific workflow provenance metadata; semantic Web technologies; Energy management; Environmental management; OWL; Ontologies; Relational databases; Reproducibility of results; Resource description framework; Semantic Web; Statistics; Technology management;
Conference_Titel :
e-Science and Grid Computing, IEEE International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-0-7695-3064-2
DOI :
10.1109/E-SCIENCE.2007.70