DocumentCode
1721974
Title
Taming complex bioinformatics workflows with weaver, makeflow, and starch
Author
Thrasher, Andrew ; Carmichael, Rory ; Bui, Peter ; Yu, Li ; Thain, Douglas ; Emrich, Scott
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Notre Damem, Notre Damem, IN, USA
fYear
2010
Firstpage
1
Lastpage
6
Abstract
In this paper we discuss challenges of common bioinformatics applications when deployed outside their initial development environments. We propose a three-tiered approach to mitigate some of these issues by leveraging an encapsulation tool, a high-level workflow language, and a portable intermediary. As a case study, we apply this approach to refactor a custom EST analysis pipeline. The Starch tool encapsulates program dependencies to simplify task specification and deployment. The Weaver language provides abstractions for distributed computing and naturally encourages code modularity. The Makeflow workflow engine provides a batch system agnostic engine to execute compiled Weaver code. To illustrate the benefits of our framework, we compare implementations, show their performance, and discuss benefits derived from our new workflow approach relative to traditional bioinformatics development.
Keywords
bioinformatics; programming languages; software tools; EST analysis pipeline; bioinformatics workflows; makeflow workflow engine; starch tool; weaver language; Bioinformatics; Encapsulation; Engines; Libraries; Pipelines; Runtime; Weaving;
fLanguage
English
Publisher
ieee
Conference_Titel
Workflows in Support of Large-Scale Science (WORKS), 2010 5th Workshop on
Conference_Location
New Orleans, LA
ISSN
2151-1373
Print_ISBN
978-1-4244-8989-3
Electronic_ISBN
2151-1373
Type
conf
DOI
10.1109/WORKS.2010.5671858
Filename
5671858
Link To Document