• DocumentCode
    172895
  • Title

    Exploiting User Patience for Scaling Resource Capacity in Cloud Services

  • Author

    Cunha, Renato L. F. ; Assuncao, Marcos D. ; Cardonha, Carlos ; Netto, Marco A. S.

  • Author_Institution
    IBM Res., São Paulo, Brazil
  • fYear
    2014
  • fDate
    June 27 2014-July 2 2014
  • Firstpage
    448
  • Lastpage
    455
  • Abstract
    An important feature of cloud computing is its elasticity, that is, the ability to have resource capacity dynamically modified according to the current system load. Auto-scaling is challenging because it must account for two conflicting objectives: minimising system capacity available to users and maximising QoS, which typically translates to short response times. Current auto-scaling techniques are based solely on load forecasts and ignore the perception that users have from cloud services. As a consequence, providers tend to provision a volume of resources that is significantly larger than necessary to keep users satisfied. In this article, we propose a scheduling algorithm and an auto-scaling triggering technique that explore user patience in order to identify critical times when auto-scaling is needed and the appropriate volume of capacity by which the cloud platform should either extend or shrink. The proposed technique assists service providers in reducing costs related to resource allocation while keeping the same QoS to users. Our experiments show that it is possible to reduce resource-hour by up to approximately 8% compared to auto-scaling based on system utilisation.
  • Keywords
    cloud computing; resource allocation; QoS maximisation; auto-scaling triggering technique; cloud computing; cloud platform; cloud services; costs reduction; load forecasts; resource allocation; resource capacity scaling; resources provision; scheduling algorithm; service providers; system capacity minimisation; system load; system utilisation; user patience; Cloud computing; Quality of service; Resource management; Scheduling; Scheduling algorithms; Time factors; Auto-scaling; Cloud Computing; Elasticity; User Patience;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2014 IEEE 7th International Conference on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4799-5062-1
  • Type

    conf

  • DOI
    10.1109/CLOUD.2014.67
  • Filename
    6973773