• DocumentCode
    2845830
  • Title

    Realistic Models and Efficient Algorithms for Fault Tolerant Scheduling on Heterogeneous Platforms

  • Author

    Benoit, Anne ; Hakem, Mourad ; Robert, Yves

  • Author_Institution
    LIP Lab., Univ. de Lyon, Lyon
  • fYear
    2008
  • fDate
    9-12 Sept. 2008
  • Firstpage
    246
  • Lastpage
    253
  • Abstract
    Most list scheduling heuristics rely on a simple platform model where communication contention is not taken into account. In addition, it is generally assumed that processors in the systems are completely safe. To schedule precedence graphs in a more realistic framework, we introduce an efficient fault tolerant scheduling algorithm that is both contention-aware and capable of supporting epsiv arbitrary fail-silent/fail-stop processor failures. We focus on a bi- criteria approach, where we aim at minimizing the total execution time, or latency, given a fixed number of failures supported in the system. Our algorithm has a low time complexity, and drastically reduces the number of additional communications induced by the replication mechanism. Experimental results fully demonstrate the usefulness of the proposed algorithm, which leads to efficient execution schemes while guaranteeing a prescribed level of fault tolerance.
  • Keywords
    fault tolerant computing; processor scheduling; fail-silent/fail-stop processor failures; fault tolerant scheduling; heterogeneous platforms; list scheduling heuristics; precedence graphs; Communication networks; Delay; Fault tolerance; Laboratories; Large-scale systems; Parallel processing; Processor scheduling; Scheduling algorithm; Communication contention; fault tolerance; heterogeneous systems.; multi-criteria scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing, 2008. ICPP '08. 37th International Conference on
  • Conference_Location
    Portland, OR
  • ISSN
    0190-3918
  • Print_ISBN
    978-0-7695-3374-2
  • Electronic_ISBN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2008.22
  • Filename
    4625856