• DocumentCode
    2996887
  • Title

    A Combined Dual-stage Framework for Robust Scheduling of Scientific Applications in Heterogeneous Environments with Uncertain Availability

  • Author

    Ciorba, Florina M. ; Hansen, Timothy ; Srivastava, Srishti ; Banicescu, Ioana ; Maciejewski, Anthony A. ; Siegel, Howard Jay

  • Author_Institution
    Center for Inf. Services & High Performance Comput., Tech. Univ. Dresden, Dresden, Germany
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    193
  • Lastpage
    207
  • Abstract
    Scheduling parallel applications on existing or emerging computing platforms is challenging, and, among other attributes, must be efficient and robust. A dual-stage framework is proposed in this paper to evaluate the robustness of efficient resource allocation and dynamic load balancing of scientific applications in heterogeneous computing environments with uncertain availability. The first stage employs robust resource allocation heuristics, while the second stage incorporates robust dynamic loop scheduling techniques. The combined dual-stage framework constitutes a comprehensive framework that enables and provides guarantees for the robust execution of scientific applications in computing systems where uncertainty is caused by various unpredictable perturbations. The paper reports on studies for determining the best techniques to be used for each stage that: (a) maximize the probability that the system make span satisfies a deadline, and (b) minimize the system make span for every given availability level in the system. The usefulness and benefits of the proposed framework are demonstrated via a small scale example.
  • Keywords
    natural sciences computing; parallel processing; probability; resource allocation; scheduling; combined dual-stage framework; dynamic load balancing; heterogeneous computing environments; parallel application scheduling; probability maximization; robust dynamic loop scheduling techniques; robust resource allocation heuristics; scientific applications; system make span minimization; uncertain availability; Availability; Dynamic scheduling; Program processors; Resource management; Robustness; Runtime; Uncertainty; dynamic loop scheduling; heterogeneous systems; high performance; non-dedicated systems; resource allocation; robustness; uncertainties;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0974-5
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2012.5
  • Filename
    6270639