DocumentCode
228708
Title
Scheduling Multi-tenant Cloud Workloads on Accelerator-Based Systems
Author
Sengupta, Dipak ; Goswami, Anshuman ; Schwan, Karsten ; Pallavi, Krishna
Author_Institution
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
fYear
2014
fDate
16-21 Nov. 2014
Firstpage
513
Lastpage
524
Abstract
Accelerator-based systems are making rapid inroads into becoming platforms of choice for high end cloud services. There is a need therefore, to move from the current model in which high performance applications explicitly and programmatically select the GPU devices on which to run, to a dynamic model where GPUs are treated as first class schedulable entities. The Strings scheduler realizes this vision by decomposing the GPU scheduling problem into a combination of load balancing and per-device scheduling. (i) Device-level scheduling efficiently uses all of a GPU´s hardware resources, including its computational and data movement engines, and (ii) load balancing goes beyond obtaining high throughput, to ensure fairness through prioritizing GPU requests that have attained least service. With its methods, Strings achieves improvements in system throughput and fairness of up to 8.70× and 13%, respectively, compared to the CUDA runtime.
Keywords
cloud computing; graphics processing units; parallel processing; resource allocation; scheduling; GPU scheduling problem; accelerator-based systems; data movement engines; device-level scheduling; dynamic model; high end cloud services; high performance applications; load balancing; multitenant cloud workload scheduling; per-device scheduling; strings scheduler; Context; Graphics processing units; Processor scheduling; Runtime; Servers; Switches; Synchronization; CUDA; GPU; Multi-tenancy; hierarchical scheduling; runtime systems; virtualization;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
Conference_Location
New Orleans, LA
Print_ISBN
978-1-4799-5499-5
Type
conf
DOI
10.1109/SC.2014.47
Filename
7013029
Link To Document