• DocumentCode
    3329388
  • Title

    STEAMEngine: Driving MapReduce provisioning in the cloud

  • Author

    Cardosa, Michael ; Narang, Piyush ; Chandra, Abhishek ; Pucha, Himabindu ; Singh, Aameek

  • fYear
    2011
  • fDate
    18-21 Dec. 2011
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    MapReduce has gained in popularity as a distributed data analysis paradigm, particularly in the cloud, where MapReduce jobs are run on virtual clusters. The provisioning of MapReduce jobs in the cloud is an important problem for optimizing several user as well as provider-side metrics, such as runtime, cost, throughput, energy, and load. In this paper, we present an intelligent provisioning framework called STEAMEngine that consists of provisioning algorithms to optimize these metrics through a set of common building blocks. These building blocks enable spatio-temporal tradeoffs unique to MapReduce provisioning: along with their resource requirements (spatial component), a MapReduce job runtime (temporal component) is a critical element for any provisioning algorithm. We also describe tw o novel provisioning algorithms - a user-driven performance optimization and a provider-driven energy optimization - that leverage these building blocks. Our experimental results based on an Amazon EC2 cluster and a local Xen/Hadoop cluster show the benefits of STEAMEngine through improvements in performance and energy via the use of these algorithms and building blocks.
  • Keywords
    cloud computing; data analysis; software metrics; software performance evaluation; workstation clusters; Amazon EC2 cluster; Hadoop cluster; MapReduce job runtime; MapReduce provisioning; STEAMEngine; Xen cluster; cloud; distributed data analysis; intelligent provisioning framework; provider-driven energy optimization; provider-side metrics; spatio-temporal tradeoffs; temporal component; user-driven performance optimization; virtual cluster; Cloud computing; Clustering algorithms; Data models; Measurement; Optimization; Runtime; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing (HiPC), 2011 18th International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    978-1-4577-1951-6
  • Electronic_ISBN
    978-1-4577-1949-3
  • Type

    conf

  • DOI
    10.1109/HiPC.2011.6152649
  • Filename
    6152649