• DocumentCode
    2016490
  • Title

    Combined Fault Tolerance and Scheduling Techniques for Workflow Applications on Computational Grids

  • Author

    Zhang, Yang ; Mandal, Anirban ; Koelbel, Charles ; Cooper, Keith

  • Author_Institution
    Dept. of Comput. Sci., Rice Univ., Houston, TX
  • fYear
    2009
  • fDate
    18-21 May 2009
  • Firstpage
    244
  • Lastpage
    251
  • Abstract
    Complex scientific workflows are now Increasingly executed on computational grids. In addition to the challenges of managing and scheduling these workflows, reliability challenges arise because of the unreliable nature of large-scale grid infrastructure. Fault tolerance mechanisms like over-provisioning and checkpoint-recovery are used in current grid application management systems to address these reliability challenges. In this work, we propose new approaches that combine these fault tolerance techniques with existing workflow scheduling algorithms. We present a study on the effectiveness of the combined approaches by analyzing their impact on the reliability of workflow execution, workflow performance and resource usage under different reliability models, failure prediction accuracies and workflow application types.
  • Keywords
    fault tolerant computing; grid computing; complex scientific workflows; computational grids; failure prediction; fault tolerance techniques; grid application management systems; grid infrastructure; reliability models; scheduling techniques; workflow applications; workflow scheduling algorithms; Computer applications; Failure analysis; Fault tolerance; Fault tolerant systems; Grid computing; Large-scale systems; Performance analysis; Predictive models; Processor scheduling; Scheduling algorithm; Fault Tolerant; Grid Computing; Performance Modeling; Workflow Scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and the Grid, 2009. CCGRID '09. 9th IEEE/ACM International Symposium on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-3935-5
  • Electronic_ISBN
    978-0-7695-3622-4
  • Type

    conf

  • DOI
    10.1109/CCGRID.2009.59
  • Filename
    5071878