• DocumentCode
    2194229
  • Title

    Predictive Data Grouping and Placement for Cloud-Based Elastic Server Infrastructures

  • Author

    Tirado, Juan M. ; Higuero, Daniel ; Isaila, Florin ; Carretero, Jesús

  • Author_Institution
    Comput. Sci. & Eng. Dept., Carlos III Univ. of Madrid, Leganes, Spain
  • fYear
    2011
  • fDate
    23-26 May 2011
  • Firstpage
    285
  • Lastpage
    294
  • Abstract
    Workload variations on Internet platforms such as YouTube, Flickr, LastFM require novel approaches to dynamic resource provisioning in order to meet QoS requirements, while reducing the Total Cost of Ownership (TCO) of the infrastructures. The economy of scale promise of cloud computing is a great opportunity to approach this problem, by developing elastic large scale server infrastructures. However, a proactive approach to dynamic resource provisioning requires prediction models forecasting future load patterns. On the other hand, unexpected volume and data spikes require reactive provisioning for serving unexpected surges in workloads. When workload can not be predicted, adequate data grouping and placement algorithms may facilitate agile scaling up and down of an infrastructure. In this paper, we analyze a dynamic workload of an on-line music portal and present an elastic Web infrastructure that adapts to workload variations by dynamically scaling up and down servers. The workload is predicted by an autoregressive model capturing trends and seasonal patterns. Further, for enhancing data locality, we propose a predictive data grouping based on the history of content access of a user community. Finally, in order to facilitate agile elasticity, we present a data placement based on workload and access pattern prediction. The experimental results demonstrate that our forecasting model predicts workload with a high precision. Further, the predictive data grouping and placement methods provide high locality, load balance and high utilization of resources, allowing a server infrastructure to scale up and down depending on workload.
  • Keywords
    Internet; autoregressive processes; cloud computing; data handling; file servers; music; portals; social networking (online); 2011; Flickr; Internet platforms; LastFM; QoS requirements; TCO; YouTube; agile elasticity; autoregressive model; cloud-based elastic server infrastructures; data locality enhancement; data spikes; elastic Web infrastructure; online music portal; predictive data grouping; predictive data placement; total cost of ownership reduction; workload variations; Autoregressive processes; Forecasting; Load modeling; Prediction algorithms; Predictive models; Web servers; cloud; data-grouping; elastic; prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2011 11th IEEE/ACM International Symposium on
  • Conference_Location
    Newport Beach, CA
  • Print_ISBN
    978-1-4577-0129-0
  • Electronic_ISBN
    978-0-7695-4395-6
  • Type

    conf

  • DOI
    10.1109/CCGrid.2011.49
  • Filename
    5948619