DocumentCode
3323469
Title
Querying and Managing Provenance through User Views in Scientific Workflows
Author
Biton, Olivier ; Cohen-Boulakia, Sarah ; Davidson, Susan B. ; Hara, Carmem S.
Author_Institution
Univ. of Pennsylvania, Philadelphia, PA
fYear
2008
fDate
7-12 April 2008
Firstpage
1072
Lastpage
1081
Abstract
Workflow systems have become increasingly popular for managing experiments where many bioinformatics tasks are chained together. Due to the large amount of data generated by these experiments and the need for reproducible results, provenance has become of paramount importance. Workflow systems are therefore starting to provide support for querying provenance. However, the amount of provenance information may be overwhelming, so there is a need for abstraction mechanisms to help users focus on the most relevant information. The technique we pursue is that of "user views". Since bioinformatics tasks may themselves be complex sub-workflows, a user view determines what level of sub-workflow the user can see, and thus what data and tasks are visible in provenance queries. In this paper, we formalize the notion of user views, demonstrate how they can be used in provenance queries, and give an algorithm for generating a user view based on which tasks are relevant for the user. We then describe our prototype and give performance results. Although presented in the context of scientific workflows, the technique applies to other data-oriented workflows.
Keywords
biology computing; query processing; scientific information systems; workflow management software; abstraction mechanisms; bioinformatics; provenance management; provenance querying; scientific workflow systems; user views; Bioinformatics; Buildings; Databases; Intrusion detection; Large-scale systems; Phylogeny; Proteins; Prototypes; Sequences; Workflow management software;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-1836-7
Electronic_ISBN
978-1-4244-1837-4
Type
conf
DOI
10.1109/ICDE.2008.4497516
Filename
4497516
Link To Document