• DocumentCode
    659405
  • Title

    HFSP: Size-based scheduling for Hadoop

  • Author

    Pastorelli, Michele ; Barbuzzi, Antonio ; Carra, Damiano ; Dell´Amico, Matteo ; Michiardi, Pietro

  • Author_Institution
    EURECOM, Sophia-Antipolis, France
  • fYear
    2013
  • fDate
    6-9 Oct. 2013
  • Firstpage
    51
  • Lastpage
    59
  • Abstract
    Size-based scheduling with aging has, for long, been recognized as an effective approach to guarantee fairness and near-optimal system response times. We present HFSP, a scheduler introducing this technique to a real, multi-server, complex and widely used system such as Hadoop. Size-based scheduling requires a priori job size information, which is not available in Hadoop: HFSP builds such knowledge by estimating it on-line during job execution. Our experiments, which are based on realistic workloads generated via a standard benchmarking suite, pinpoint at a significant decrease in system response times with respect to the widely used Hadoop Fair scheduler, and show that HFSP is largely tolerant to job size estimation errors.
  • Keywords
    distributed processing; resource allocation; scheduling; HFSP scheduler; Hadoop Fair scheduler; fairness; job execution; job size estimation errors; job size information; near-optimal system response times; size-based scheduling; Abstracts; Aging; Estimation error; Processor scheduling; Schedules; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data, 2013 IEEE International Conference on
  • Conference_Location
    Silicon Valley, CA
  • Type

    conf

  • DOI
    10.1109/BigData.2013.6691554
  • Filename
    6691554