DocumentCode
659405
Title
HFSP: Size-based scheduling for Hadoop
Author
Pastorelli, Michele ; Barbuzzi, Antonio ; Carra, Damiano ; Dell´Amico, Matteo ; Michiardi, Pietro
Author_Institution
EURECOM, Sophia-Antipolis, France
fYear
2013
fDate
6-9 Oct. 2013
Firstpage
51
Lastpage
59
Abstract
Size-based scheduling with aging has, for long, been recognized as an effective approach to guarantee fairness and near-optimal system response times. We present HFSP, a scheduler introducing this technique to a real, multi-server, complex and widely used system such as Hadoop. Size-based scheduling requires a priori job size information, which is not available in Hadoop: HFSP builds such knowledge by estimating it on-line during job execution. Our experiments, which are based on realistic workloads generated via a standard benchmarking suite, pinpoint at a significant decrease in system response times with respect to the widely used Hadoop Fair scheduler, and show that HFSP is largely tolerant to job size estimation errors.
Keywords
distributed processing; resource allocation; scheduling; HFSP scheduler; Hadoop Fair scheduler; fairness; job execution; job size estimation errors; job size information; near-optimal system response times; size-based scheduling; Abstracts; Aging; Estimation error; Processor scheduling; Schedules; Time factors;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data, 2013 IEEE International Conference on
Conference_Location
Silicon Valley, CA
Type
conf
DOI
10.1109/BigData.2013.6691554
Filename
6691554
Link To Document