DocumentCode :
1927077
Title :
Fault-aware, utility-based job scheduling on Blue, Gene/P systems
Author :
Tang, Wei ; Lan, Zhiling ; Desai, Narayan ; Buettner, Daniel
Author_Institution :
Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
fYear :
2009
fDate :
Aug. 31 2009-Sept. 4 2009
Firstpage :
1
Lastpage :
10
Abstract :
Job scheduling on large-scale systems is an increasingly complicated affair, with numerous factors influencing scheduling policy. Addressing these concerns results in sophisticated scheduling policies that can be difficult to reason about. In this paper, we present a general utility-based scheduling framework to balance various scheduling requirements and priorities. It enables system owners to customize scheduling policies under different circumstances without changing the scheduling code. We also develop a fault-aware job allocation strategy for Blue Gene/P systems to address the increasing concern of system failures. We demonstrate the effectiveness of these facilities by means of event-driven simulations with real job traces collected from the production Blue Gene/P system at Argonne National Laboratory.
Keywords :
mainframes; parallel machines; processor scheduling; Argonne National Laboratory; Blue Gene/P system; fault-aware job allocation strategy; fault-aware scheduling; large-scale system; utility-based job scheduling; Computer science; Delay; Discrete event simulation; Energy consumption; Job production systems; Laboratories; Large-scale systems; Mathematics; Processor scheduling; Resource management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and Workshops, 2009. CLUSTER '09. IEEE International Conference on
Conference_Location :
New Orleans, LA
ISSN :
1552-5244
Print_ISBN :
978-1-4244-5011-4
Electronic_ISBN :
1552-5244
Type :
conf
DOI :
10.1109/CLUSTR.2009.5289206
Filename :
5289206
Link To Document :
بازگشت