• DocumentCode
    1804682
  • Title

    Priority algorithm for near-data scheduling: Throughput and heavy-traffic optimality

  • Author

    Qiaomin Xie ; Yi Lu

  • Author_Institution
    Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2015
  • fDate
    April 26 2015-May 1 2015
  • Firstpage
    963
  • Lastpage
    972
  • Abstract
    The prevalence of data-parallel applications has made near-data scheduling an important problem. An example is the map task scheduling in the map-reduce framework. Wang et. al. [13] was the first to identify its capacity region and proposed a throughput-optimal algorithm based on MaxWeight. However, the study of the algorithm´s delay performance revealed that it is only heavy-traffic optimal for a very special traffic scenario, where all traffic concentrates on a subset of servers. We propose a simple “local-tasks first” priority algorithm and show that it is throughput-optimal and heavy-traffic optimal for all traffic scenarios, i.e., it asymptotically minimizes the average delay as the arrival rate vector approaches the boundary of the capacity region. So far, it is the only known heavy-traffic optimal algorithm for this setting. As the algorithm is based on pre-determined priority, a direct application of the Lyapunov drift technique does not work. The main proof ideas are the construction of an ideal load decomposition and the separate treatment of two subsystems based on their ideal load. To the best of our knowledge, this is the only setup of affinity scheduling where a simple priority algorithm is shown to be heavy-traffic optimal. Simulation shows that our algorithm also significantly outperforms existing algorithms at loads away from the boundary of the capacity region.
  • Keywords
    telecommunication scheduling; telecommunication traffic; vectors; MaxWeight; affinity scheduling; arrival rate vector; capacity region boundary; data-parallel applications; delay performance; heavy-traffic optimal algorithm; ideal load decomposition; local-tasks first priority algorithm; map task scheduling; map-reduce framework; near-data scheduling; predetermined priority; throughput-optimal algorithm; very special traffic scenario; Computers; Conferences; Delays; Job shop scheduling; Load modeling; Servers; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communications (INFOCOM), 2015 IEEE Conference on
  • Conference_Location
    Kowloon
  • Type

    conf

  • DOI
    10.1109/INFOCOM.2015.7218468
  • Filename
    7218468