• DocumentCode
    169125
  • Title

    Modeling the Impact of Workload on Cloud Resource Scaling

  • Author

    Gandhi, Anshul ; Dube, Parijat ; Karve, Alexei ; Kochut, Andrzej ; Li Zhang

  • Author_Institution
    T.J. Watson Res. Center, IBM, Yorktown Heights, NY, USA
  • fYear
    2014
  • fDate
    22-24 Oct. 2014
  • Firstpage
    310
  • Lastpage
    317
  • Abstract
    Cloud computing offers the flexibility to dynamically size the infrastructure in response to changes in workload demand. While both horizontal and vertical scaling of infrastructure is supported by major cloud providers, these scaling options differ significantly in terms of their cost, provisioning time, and their impact on workload performance. Importantly, the efficacy of horizontal and vertical scaling critically depends on the workload characteristics, such as the workload´s parallelizability and its core scalability. In today´s cloud systems, the scaling decision is left to the users, requiring them to fully understand the tradeoffs associated with the different scaling options. In this paper, we present our solution for optimizing the resource scaling of cloud deployments via implementation in OpenStack. The key component of our solution is the modelling engine that characterizes the workload and then quantitatively evaluates different scaling options for that workload. Our modelling engine leverages Amdahl´s Law to model service time scaling in scaleup environments and queueing-theoretic concepts to model performance scaling in scale-out environments. We further employ Kalman filtering to account for inaccuracies in the model-based methodology, and to dynamically track changes in the workload and cloud environment.
  • Keywords
    Kalman filters; cloud computing; queueing theory; resource allocation; Amdahl´s Law; Kalman filtering; OpenStack; cloud computing; cloud deployments; cloud providers; cloud resource scaling; cloud systems; core scalability; horizontal infrastructure scaling; model-based methodology; modeling engine; queueing-theoretic concepts; vertical infrastructure scaling; workload demand; workload impact modeling; workload parallelizability; workload performance; Engines; Equations; Kalman filters; Load modeling; Mathematical model; Monitoring; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture and High Performance Computing (SBAC-PAD), 2014 IEEE 26th International Symposium on
  • Conference_Location
    Jussieu
  • ISSN
    1550-6533
  • Type

    conf

  • DOI
    10.1109/SBAC-PAD.2014.16
  • Filename
    6970679