DocumentCode :
2862907
Title :
Energy Prediction for MapReduce Workloads
Author :
Li, Wenjun ; Yang, Hailong ; Luan, Zhongzhi ; Qian, Depei
Author_Institution :
State Key Lab. of Software Dev. Environ., Beihang Univ., Beijing, China
fYear :
2011
fDate :
12-14 Dec. 2011
Firstpage :
443
Lastpage :
448
Abstract :
Energy efficiency of data centers has attracted wide research attention with growing concern for power consumption and heat dissipation. Map Reduce as an efficient programming model for data-intensive computing is increasingly popular among industrial companies and academic organizations. As Map Reduce is developed specifically to process large-scale data analysis, its impact on energy efficiency of data centers has not been well scrutinized. Recently some energy conserving strategies have been proposed to reduce the overall power consumption of Map Reduce clusters. The fundamental ideas of previous work can be summarized as scaling down working nodes and reducing execution time. However, there are few researches on energy prediction for Map Reduce workloads, which can offer guide for cluster administrator to make power budget or schedule workloads to clusters with different power budget, and be useful for monitoring workloads´ energy consumption. In this paper, we identify several workload metrics that have strong correlations with energy consumption. We use multivariate linear regression to analyze these metrics, and then construct a prediction model. Regression diagnosis is performed intensively to optimize the prediction model. After applying to the Word Count and Sort workloads with various input size, we find our prediction model is highly accurate with 0.12% and 0.15% inaccuracy compared to the observed energy consumption in the best and worst cases.
Keywords :
computer centres; cooling; data analysis; distributed processing; power consumption; regression analysis; MapReduce workloads; academic organizations; data centers; data-intensive computing; energy efficiency; energy prediction; heat dissipation; industrial companies; large-scale data analysis; multivariate linear regression; power consumption; Energy consumption; Energy efficiency; Equations; Linear regression; Mathematical model; Measurement; Predictive models; MapReduce; data intensive computing; energy prediction; multivariate linear regression; regression diagnosis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable, Autonomic and Secure Computing (DASC), 2011 IEEE Ninth International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4673-0006-3
Type :
conf
DOI :
10.1109/DASC.2011.88
Filename :
6118750
Link To Document :
بازگشت