Title :
A Combined Dual-stage Framework for Robust Scheduling of Scientific Applications in Heterogeneous Environments with Uncertain Availability
Author :
Ciorba, Florina M. ; Hansen, Timothy ; Srivastava, Srishti ; Banicescu, Ioana ; Maciejewski, Anthony A. ; Siegel, Howard Jay
Author_Institution :
Center for Inf. Services & High Performance Comput., Tech. Univ. Dresden, Dresden, Germany
Abstract :
Scheduling parallel applications on existing or emerging computing platforms is challenging, and, among other attributes, must be efficient and robust. A dual-stage framework is proposed in this paper to evaluate the robustness of efficient resource allocation and dynamic load balancing of scientific applications in heterogeneous computing environments with uncertain availability. The first stage employs robust resource allocation heuristics, while the second stage incorporates robust dynamic loop scheduling techniques. The combined dual-stage framework constitutes a comprehensive framework that enables and provides guarantees for the robust execution of scientific applications in computing systems where uncertainty is caused by various unpredictable perturbations. The paper reports on studies for determining the best techniques to be used for each stage that: (a) maximize the probability that the system make span satisfies a deadline, and (b) minimize the system make span for every given availability level in the system. The usefulness and benefits of the proposed framework are demonstrated via a small scale example.
Keywords :
natural sciences computing; parallel processing; probability; resource allocation; scheduling; combined dual-stage framework; dynamic load balancing; heterogeneous computing environments; parallel application scheduling; probability maximization; robust dynamic loop scheduling techniques; robust resource allocation heuristics; scientific applications; system make span minimization; uncertain availability; Availability; Dynamic scheduling; Program processors; Resource management; Robustness; Runtime; Uncertainty; dynamic loop scheduling; heterogeneous systems; high performance; non-dedicated systems; resource allocation; robustness; uncertainties;
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0974-5
DOI :
10.1109/IPDPSW.2012.5