• DocumentCode
    168707
  • Title

    Expanding Tasks of Logical Workflows Into Independent Workflows for Improved Scalability

  • Author

    Hazekamp, Nicholas ; Choudhury, Olivia ; Gesing, Sandra ; Emrich, S. ; Thain, D.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Notre Dame, Notre Dame, IN, USA
  • fYear
    2014
  • fDate
    26-29 May 2014
  • Firstpage
    548
  • Lastpage
    549
  • Abstract
    Workflow Management Systems, such as Galaxy and Taverna, provide a portal through which data can be processed using a sequence of different tools. This sequence allows for the creation of a logical workflow that describes the process. However, when the data workload becomes large enough the time spent in each logical step increases making it difficult to run the workflow fast and efficiently. The proposed solutions is to use task level expansion. Task expansion aims to take each step of the logical workflow and expand it into a new self-contained workflow. These workflows would allow for greater scalability and concurrency by creating more tasks. The resulting workflows will be used indistinguishably from the original tool, but perform more quickly and efficiently. The concept was applied to the BWA tool in Galaxy and we were able to see a 7.36 times speedup in runtime on our 32 GB dataset.
  • Keywords
    concurrency control; workflow management software; BWA tool; Galaxy; Taverna; concurrency; data processing; independent workflows; logical workflows; portal; scalability improvement; self-contained workflow; task level expansion; workflow management systems; Bioinformatics; Cloud computing; Conferences; Engines; Genomics; Portals; Scalability; bioinformatics; task expansion; workflows;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/CCGrid.2014.84
  • Filename
    6846496