DocumentCode
168707
Title
Expanding Tasks of Logical Workflows Into Independent Workflows for Improved Scalability
Author
Hazekamp, Nicholas ; Choudhury, Olivia ; Gesing, Sandra ; Emrich, S. ; Thain, D.
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Notre Dame, Notre Dame, IN, USA
fYear
2014
fDate
26-29 May 2014
Firstpage
548
Lastpage
549
Abstract
Workflow Management Systems, such as Galaxy and Taverna, provide a portal through which data can be processed using a sequence of different tools. This sequence allows for the creation of a logical workflow that describes the process. However, when the data workload becomes large enough the time spent in each logical step increases making it difficult to run the workflow fast and efficiently. The proposed solutions is to use task level expansion. Task expansion aims to take each step of the logical workflow and expand it into a new self-contained workflow. These workflows would allow for greater scalability and concurrency by creating more tasks. The resulting workflows will be used indistinguishably from the original tool, but perform more quickly and efficiently. The concept was applied to the BWA tool in Galaxy and we were able to see a 7.36 times speedup in runtime on our 32 GB dataset.
Keywords
concurrency control; workflow management software; BWA tool; Galaxy; Taverna; concurrency; data processing; independent workflows; logical workflows; portal; scalability improvement; self-contained workflow; task level expansion; workflow management systems; Bioinformatics; Cloud computing; Conferences; Engines; Genomics; Portals; Scalability; bioinformatics; task expansion; workflows;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
Conference_Location
Chicago, IL
Type
conf
DOI
10.1109/CCGrid.2014.84
Filename
6846496
Link To Document