• DocumentCode
    1721974
  • Title

    Taming complex bioinformatics workflows with weaver, makeflow, and starch

  • Author

    Thrasher, Andrew ; Carmichael, Rory ; Bui, Peter ; Yu, Li ; Thain, Douglas ; Emrich, Scott

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Notre Damem, Notre Damem, IN, USA
  • fYear
    2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper we discuss challenges of common bioinformatics applications when deployed outside their initial development environments. We propose a three-tiered approach to mitigate some of these issues by leveraging an encapsulation tool, a high-level workflow language, and a portable intermediary. As a case study, we apply this approach to refactor a custom EST analysis pipeline. The Starch tool encapsulates program dependencies to simplify task specification and deployment. The Weaver language provides abstractions for distributed computing and naturally encourages code modularity. The Makeflow workflow engine provides a batch system agnostic engine to execute compiled Weaver code. To illustrate the benefits of our framework, we compare implementations, show their performance, and discuss benefits derived from our new workflow approach relative to traditional bioinformatics development.
  • Keywords
    bioinformatics; programming languages; software tools; EST analysis pipeline; bioinformatics workflows; makeflow workflow engine; starch tool; weaver language; Bioinformatics; Encapsulation; Engines; Libraries; Pipelines; Runtime; Weaving;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Workflows in Support of Large-Scale Science (WORKS), 2010 5th Workshop on
  • Conference_Location
    New Orleans, LA
  • ISSN
    2151-1373
  • Print_ISBN
    978-1-4244-8989-3
  • Electronic_ISBN
    2151-1373
  • Type

    conf

  • DOI
    10.1109/WORKS.2010.5671858
  • Filename
    5671858