• DocumentCode
    1803259
  • Title

    Enabling ScientificWorkflow Reuse through Structured Composition of Dataflow and Control-Flow

  • Author

    Bowers, Shawn ; Ludäscher, Bertram ; Ngu, Anne H H ; Critchlow, Terence

  • Author_Institution
    University of California, Davis
  • fYear
    2006
  • fDate
    2006
  • Firstpage
    70
  • Lastpage
    70
  • Abstract
    Data-centric scientific workflows are often modeled as dataflow process networks. The simplicity of the dataflow framework facilitates workflow design, analysis, and optimization. However, modeling "control-flow intensive" tasks using dataflow constructs often leads to overly complicated workflows that are hard to comprehend, reuse, and maintain. We describe a generic framework, based on scientific workflow templates and frames, for embedding control-flow intensive subtasks within dataflow process networks. This approach can seamlessly handle complex control-flow without sacrificing the benefits of dataflow. We illustrate our approach with a real-world scientific workflow from the astrophysics domain, requiring remote execution and file transfer in a semi-reliable environment. For such workflows, we also describe a 3-layered architecture based on frames and templates where the top-layer consists of an overall dataflow process network, the second layer consists of a tranducer template for modeling the desired control-flow behavior, and the bottom layer consists of frames inside the template that are specialized by embedding the desired component implementation. Our approach can enable scientific workflows that are more robust (faulttolerance strategies can be defined by control-flow driven transducer templates) and at the same time more reusable, since the embedding of frames and templates yields more structured and modular workflow designs.
  • Keywords
    Bioinformatics; Computer science; Data engineering; Data structures; Genomics; Laboratories; Mathematical model; Processor scheduling; Robust control; Scientific computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshops, 2006. Proceedings. 22nd International Conference on
  • Conference_Location
    Atlanta, GA, USA
  • Print_ISBN
    0-7695-2571-7
  • Type

    conf

  • DOI
    10.1109/ICDEW.2006.55
  • Filename
    1623865