• DocumentCode
    3600604
  • Title

    A Service Framework for Scientific Workflow Management in the Cloud

  • Author

    Yong Zhao ; Youfu Li ; Raicu, Ioan ; Shiyong Lu ; Cui Lin ; Yanzhe Zhang ; Wenhong Tian ; Ruini Xue

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • Volume
    8
  • Issue
    6
  • fYear
    2015
  • Firstpage
    930
  • Lastpage
    944
  • Abstract
    Cloud computing is an emerging computing paradigm that can offer unprecedented scalability and resources on demand, and is getting more and more adoption in the science community, while scientific workflow management systems provide essential support such as management of data and task dependencies, job scheduling and execution, provenance tracking, etc., to scientific computing. As we are entering into a “big data” era, it is imperative to migrate scientific workflow management systems into the cloud to manage the ever increasing data scale and analysis complexity. We propose a reference service framework for integrating scientific workflow management systems into various cloud platforms, which consists of eight major components, including Cloud Workflow Management Service, Cloud Resource Manager, etc., and six interfaces between them. We also present a reference framework for the implementation of Cloud Resource Manager, which is responsible for the provisioning and management of virtual resources in the cloud. We discuss our implementation of the framework by integrating the Swift scientific workflow management system with the OpenNebula and Eucalyptus cloud platforms, and demonstrate the capability of the solution using a NASA MODIS image processing workflow and a production deployment on the Science@Guoshi network with support for the Montage image mosaic workflow.
  • Keywords
    Big Data; cloud computing; data analysis; image processing; natural sciences computing; resource allocation; workflow management software; Eucalyptus cloud platform; Montage image mosaic workflow; NASA MODIS image processing workflow; OpenNebula cloud platform; Science@Guoshi network; Swift scientific workflow management system; big data era; cloud computing; cloud resource manager; cloud workflow management service; data analysis complexity; data management; data scale; job scheduling; production deployment; provenance tracking; reference service framework; science community; scientific computing; scientific workflow management systems; task dependencies; virtual resource management; virtual resource provisioning; Cloud computing; Computer architecture; Processor scheduling; Resource management; Scalability; Virtual machining; Cloud workflow; cloud resource management; reference service framework; swift; virtual cluster provisioning; workflow-as-a-service;
  • fLanguage
    English
  • Journal_Title
    Services Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1939-1374
  • Type

    jour

  • DOI
    10.1109/TSC.2014.2341235
  • Filename
    6872792