• DocumentCode
    1915955
  • Title

    Deploying Bioinformatics Workflows on Clouds with Galaxy and Globus Provision

  • Author

    Bo Liu ; Sotomayor, Borja ; Madduri, Ravi ; Chard, Kyle ; Foster, Ian

  • Author_Institution
    Comput. Inst., Univ. of Chicago, Chicago, IL, USA
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    1087
  • Lastpage
    1095
  • Abstract
    Cloud computing is attracting increasing attention as a means of providing users with fast provisioning of computational and storage resources, elastic scaling, and payas-you-go pricing. The integration of scientific workflows and Cloud computing has the potential to significantly improve resource utilization, processing speed, and user experience. This paper proposes a novel approach for deploying bioinformatics workflows in Cloud environments using Galaxy, a platform for scientific workflows, and Globus Provision, a tool for deploying distributed computing clusters on Amazon EC2. Collectively this combination of tools provides an easy to use, high performance and scalable workflow environment that addresses the needs of data-intensive applications through dynamic cluster configuration, automatic user-defined node provisioning, high speed data transfer, and automated deployment and configuration of domain-specific software. To demonstrate how this approach can be used in practice we present a domain-specific workflow use case and performance evaluation.
  • Keywords
    bioinformatics; distributed processing; resource allocation; workflow management software; Amazon EC2; bioinformatics workflow; cloud computing; computational resource; data transfer; distributed computing cluster; domain-specific software; dynamic cluster configuration; elastic scaling; galaxy provision; globus provision; pay-as-you-go pricing; processing speed; resource utilization; scientific workflow; storage resource; user experience; user-defined node provisioning; Cloud computing; Galaxy; Globus provision; Scientific workflow;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
  • Conference_Location
    Salt Lake City, UT
  • Print_ISBN
    978-1-4673-6218-4
  • Type

    conf

  • DOI
    10.1109/SC.Companion.2012.131
  • Filename
    6495913