DocumentCode :
114121
Title :
Evaluation of approaches for power estimation in a computing cluster
Author :
Hao Zhu ; Grosso, Paola ; Xiangke Liao ; de Laat, Cees
Author_Institution :
Syst. & Network Eng. Res. group, Univ. of Amsterdam, Amsterdam, Netherlands
fYear :
2014
fDate :
3-5 Nov. 2014
Firstpage :
1
Lastpage :
10
Abstract :
Data centers as a cost-effective infrastructure for hosting Cloud and Grid applications incur tremendous energy cost and C02 emissions in terms of power distribution and cooling. One of the effective approaches for saving energy in a cluster environment is workload consolidation. However, it is challenging to address this schedule problem as it requires the understanding of various cost factors. One of the important factors is the estimation of power consumption. Power models used in most of workload schedule solutions are a linear function of resource features, but we analysed the measurement data from our cluster and found the resource loads, in particular I/O load, had no convincing linear-correlation with power consumption. Based on measurement data sets from our cluster, we propose multiple non-linear machine learning approaches to estimate power consumption of an entire node using OS-reported resource features. We evaluate the accuracy, portability and usability of the linear and non-linear approaches. Our work shows the multiple-variable linear regression approach is more precise than the CPU only linear approach. The neural network approaches have a slight advantage - its mean root mean square error is at most 15% less than that of the multiple-variable linear approach. But the neural network models have worse portability when the models generated on a node are applied on its homogeneous nodes. Gaussian Mixture Model has the highest accuracy on Hadoop nodes but requires the longest training time.
Keywords :
Gaussian processes; computer centres; input-output programs; learning (artificial intelligence); mean square error methods; mixture models; neural nets; power aware computing; regression analysis; scheduling; software portability; Gaussian mixture Model; Hadoop nodes; I/O load; OS-reported resource features; accuracy evaluation; carbon dioxide emissions; cloud applications; computing cluster; cooling; cost factors; data centers; energy cost; energy saving; grid applications; mean root mean square error; measurement data sets; multiple nonlinear machine learning approaches; multiple-variable linear regression approach; neural network approaches; portability evaluation; power consumption estimation; power distribution; power estimation approaches; power models; source loads; training time; usability evaluation; workload consolidation; workload schedule; Biological system modeling; Computational modeling; Estimation; Google; Neurons; Power demand; Training; Energy Monitoring; Machine learning; Power Model; Workload Characterization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Green Computing Conference (IGCC), 2014 International
Conference_Location :
Dallas, TX
Type :
conf
DOI :
10.1109/IGCC.2014.7039145
Filename :
7039145
Link To Document :
بازگشت