DocumentCode :
249301
Title :
Energy-Aware Scheduling of MapReduce Jobs
Author :
Mashayekhy, Lena ; Nejad, Mahyar Movahed ; Grosu, Daniel ; Dajun Lu ; Weisong Shi
Author_Institution :
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
fYear :
2014
fDate :
June 27 2014-July 2 2014
Firstpage :
32
Lastpage :
39
Abstract :
The majority of large-scale data intensive applications executed by data centers are based on MapReduce or its open-source implementation, Hadoop. Such applications are executed on large clusters requiring large amounts of energy, making the energy costs a large fraction of the data center´s overall costs. Therefore minimizing the energy consumption when executing MapReduce jobs is a critical concern for data centers. In this paper, we propose a framework for improving the energy efficiency of MapReduce applications, while satisfying the service level agreement (SLA). We first model the problem of energy-aware scheduling of MapReduce jobs as an Integer Program. We then propose a greedy algorithm, called Energy-aware MapReduce Scheduling Algorithm (EMRSA), that finds the assignments of map and reduce tasks to the machine slots in order to minimize the energy consumed when executing the application. We perform experiments on a large Hadoop cluster to determine the energy consumption of several MapReduce benchmark applications, and then use this data in an extensive simulation study to characterize the performance of the proposed algorithm. The results show that EMRSA is able to find job schedules consuming 40% less energy on average than the schedules obtained by a common practice scheduler that minimizes the makespan.
Keywords :
contracts; data analysis; energy conservation; energy consumption; greedy algorithms; integer programming; power aware computing; scheduling; EMRSA; Hadoop cluster; MapReduce jobs scheduling; SLA; energy consumption; energy efficiency; energy-aware MapReduce scheduling algorithm; greedy algorithm; integer program; machine slots; service level agreement; Algorithm design and analysis; Benchmark testing; Clustering algorithms; Energy consumption; Optimized production technology; Schedules; Scheduling algorithms; MapReduce; big data; minimizing energy consumption; scheduling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5056-0
Type :
conf
DOI :
10.1109/BigData.Congress.2014.15
Filename :
6906758
Link To Document :
بازگشت