• DocumentCode
    1759083
  • Title

    Cost-Effective Resource Provisioning for MapReduce in a Cloud

  • Author

    Palanisamy, Balaji ; Singh, Aameek ; Ling Liu

  • Author_Institution
    Sch. of Inf. Sci., Univ. of Pittsburgh, Pittsburgh, PA, USA
  • Volume
    26
  • Issue
    5
  • fYear
    2015
  • fDate
    May 1 2015
  • Firstpage
    1265
  • Lastpage
    1279
  • Abstract
    This paper presents a new MapReduce cloud service model, Cura, for provisioning cost-effective MapReduce services in a cloud. In contrast to existing MapReduce cloud services such as a generic compute cloud or a dedicated MapReduce cloud, Cura has a number of unique benefits. First, Cura is designed to provide a cost-effective solution to efficiently handle MapReduce production workloads that have a significant amount of interactive jobs. Second, unlike existing services that require customers to decide the resources to be used for the jobs, Cura leverages MapReduce profiling to automatically create the best cluster configuration for the jobs. While the existing models allow only a per-job resource optimization for the jobs, Cura implements a globally efficient resource allocation scheme that significantly reduces the resource usage cost in the cloud. Third, Cura leverages unique optimization opportunities when dealing with workloads that can withstand some slack. By effectively multiplexing the available cloud resources among the jobs based on the job requirements, Cura achieves significantly lower resource usage costs for the jobs. Cura´s core resource management schemes include cost-aware resource provisioning, VM-aware scheduling and online virtual machine reconfiguration. Our experimental results using Facebook-like workload traces show that our techniques lead to more than 80 percent reduction in the cloud compute infrastructure cost with upto 65 percent reduction in job response times.
  • Keywords
    cloud computing; cost reduction; parallel programming; resource allocation; scheduling; virtual machines; Cura model; MapReduce cloud service model; MapReduce production workload; MapReduce profiling; MapReduce services; VM-aware scheduling; cost-aware resource provisioning; cost-effective resource provisioning; cost-effective solution; dedicated MapReduce cloud; generic compute cloud; online virtual machine reconfiguration; per-job resource optimization; resource allocation scheme; resource usage cost reduction; Computational modeling; Optimization; Resource management; Schedules; Scheduling; Time factors; MapReduce; cloud computing; cost-effectiveness; scheduling;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2014.2320498
  • Filename
    6805615