• DocumentCode
    228708
  • Title

    Scheduling Multi-tenant Cloud Workloads on Accelerator-Based Systems

  • Author

    Sengupta, Dipak ; Goswami, Anshuman ; Schwan, Karsten ; Pallavi, Krishna

  • Author_Institution
    Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2014
  • fDate
    16-21 Nov. 2014
  • Firstpage
    513
  • Lastpage
    524
  • Abstract
    Accelerator-based systems are making rapid inroads into becoming platforms of choice for high end cloud services. There is a need therefore, to move from the current model in which high performance applications explicitly and programmatically select the GPU devices on which to run, to a dynamic model where GPUs are treated as first class schedulable entities. The Strings scheduler realizes this vision by decomposing the GPU scheduling problem into a combination of load balancing and per-device scheduling. (i) Device-level scheduling efficiently uses all of a GPU´s hardware resources, including its computational and data movement engines, and (ii) load balancing goes beyond obtaining high throughput, to ensure fairness through prioritizing GPU requests that have attained least service. With its methods, Strings achieves improvements in system throughput and fairness of up to 8.70× and 13%, respectively, compared to the CUDA runtime.
  • Keywords
    cloud computing; graphics processing units; parallel processing; resource allocation; scheduling; GPU scheduling problem; accelerator-based systems; data movement engines; device-level scheduling; dynamic model; high end cloud services; high performance applications; load balancing; multitenant cloud workload scheduling; per-device scheduling; strings scheduler; Context; Graphics processing units; Processor scheduling; Runtime; Servers; Switches; Synchronization; CUDA; GPU; Multi-tenancy; hierarchical scheduling; runtime systems; virtualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
  • Conference_Location
    New Orleans, LA
  • Print_ISBN
    978-1-4799-5499-5
  • Type

    conf

  • DOI
    10.1109/SC.2014.47
  • Filename
    7013029