• DocumentCode
    2862907
  • Title

    Energy Prediction for MapReduce Workloads

  • Author

    Li, Wenjun ; Yang, Hailong ; Luan, Zhongzhi ; Qian, Depei

  • Author_Institution
    State Key Lab. of Software Dev. Environ., Beihang Univ., Beijing, China
  • fYear
    2011
  • fDate
    12-14 Dec. 2011
  • Firstpage
    443
  • Lastpage
    448
  • Abstract
    Energy efficiency of data centers has attracted wide research attention with growing concern for power consumption and heat dissipation. Map Reduce as an efficient programming model for data-intensive computing is increasingly popular among industrial companies and academic organizations. As Map Reduce is developed specifically to process large-scale data analysis, its impact on energy efficiency of data centers has not been well scrutinized. Recently some energy conserving strategies have been proposed to reduce the overall power consumption of Map Reduce clusters. The fundamental ideas of previous work can be summarized as scaling down working nodes and reducing execution time. However, there are few researches on energy prediction for Map Reduce workloads, which can offer guide for cluster administrator to make power budget or schedule workloads to clusters with different power budget, and be useful for monitoring workloads´ energy consumption. In this paper, we identify several workload metrics that have strong correlations with energy consumption. We use multivariate linear regression to analyze these metrics, and then construct a prediction model. Regression diagnosis is performed intensively to optimize the prediction model. After applying to the Word Count and Sort workloads with various input size, we find our prediction model is highly accurate with 0.12% and 0.15% inaccuracy compared to the observed energy consumption in the best and worst cases.
  • Keywords
    computer centres; cooling; data analysis; distributed processing; power consumption; regression analysis; MapReduce workloads; academic organizations; data centers; data-intensive computing; energy efficiency; energy prediction; heat dissipation; industrial companies; large-scale data analysis; multivariate linear regression; power consumption; Energy consumption; Energy efficiency; Equations; Linear regression; Mathematical model; Measurement; Predictive models; MapReduce; data intensive computing; energy prediction; multivariate linear regression; regression diagnosis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable, Autonomic and Secure Computing (DASC), 2011 IEEE Ninth International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-1-4673-0006-3
  • Type

    conf

  • DOI
    10.1109/DASC.2011.88
  • Filename
    6118750