• DocumentCode
    3323469
  • Title

    Querying and Managing Provenance through User Views in Scientific Workflows

  • Author

    Biton, Olivier ; Cohen-Boulakia, Sarah ; Davidson, Susan B. ; Hara, Carmem S.

  • Author_Institution
    Univ. of Pennsylvania, Philadelphia, PA
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    1072
  • Lastpage
    1081
  • Abstract
    Workflow systems have become increasingly popular for managing experiments where many bioinformatics tasks are chained together. Due to the large amount of data generated by these experiments and the need for reproducible results, provenance has become of paramount importance. Workflow systems are therefore starting to provide support for querying provenance. However, the amount of provenance information may be overwhelming, so there is a need for abstraction mechanisms to help users focus on the most relevant information. The technique we pursue is that of "user views". Since bioinformatics tasks may themselves be complex sub-workflows, a user view determines what level of sub-workflow the user can see, and thus what data and tasks are visible in provenance queries. In this paper, we formalize the notion of user views, demonstrate how they can be used in provenance queries, and give an algorithm for generating a user view based on which tasks are relevant for the user. We then describe our prototype and give performance results. Although presented in the context of scientific workflows, the technique applies to other data-oriented workflows.
  • Keywords
    biology computing; query processing; scientific information systems; workflow management software; abstraction mechanisms; bioinformatics; provenance management; provenance querying; scientific workflow systems; user views; Bioinformatics; Buildings; Databases; Intrusion detection; Large-scale systems; Phylogeny; Proteins; Prototypes; Sequences; Workflow management software;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-1836-7
  • Electronic_ISBN
    978-1-4244-1837-4
  • Type

    conf

  • DOI
    10.1109/ICDE.2008.4497516
  • Filename
    4497516