Title :
Predictive Data Grouping and Placement for Cloud-Based Elastic Server Infrastructures
Author :
Tirado, Juan M. ; Higuero, Daniel ; Isaila, Florin ; Carretero, Jesús
Author_Institution :
Comput. Sci. & Eng. Dept., Carlos III Univ. of Madrid, Leganes, Spain
Abstract :
Workload variations on Internet platforms such as YouTube, Flickr, LastFM require novel approaches to dynamic resource provisioning in order to meet QoS requirements, while reducing the Total Cost of Ownership (TCO) of the infrastructures. The economy of scale promise of cloud computing is a great opportunity to approach this problem, by developing elastic large scale server infrastructures. However, a proactive approach to dynamic resource provisioning requires prediction models forecasting future load patterns. On the other hand, unexpected volume and data spikes require reactive provisioning for serving unexpected surges in workloads. When workload can not be predicted, adequate data grouping and placement algorithms may facilitate agile scaling up and down of an infrastructure. In this paper, we analyze a dynamic workload of an on-line music portal and present an elastic Web infrastructure that adapts to workload variations by dynamically scaling up and down servers. The workload is predicted by an autoregressive model capturing trends and seasonal patterns. Further, for enhancing data locality, we propose a predictive data grouping based on the history of content access of a user community. Finally, in order to facilitate agile elasticity, we present a data placement based on workload and access pattern prediction. The experimental results demonstrate that our forecasting model predicts workload with a high precision. Further, the predictive data grouping and placement methods provide high locality, load balance and high utilization of resources, allowing a server infrastructure to scale up and down depending on workload.
Keywords :
Internet; autoregressive processes; cloud computing; data handling; file servers; music; portals; social networking (online); 2011; Flickr; Internet platforms; LastFM; QoS requirements; TCO; YouTube; agile elasticity; autoregressive model; cloud-based elastic server infrastructures; data locality enhancement; data spikes; elastic Web infrastructure; online music portal; predictive data grouping; predictive data placement; total cost of ownership reduction; workload variations; Autoregressive processes; Forecasting; Load modeling; Prediction algorithms; Predictive models; Web servers; cloud; data-grouping; elastic; prediction;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2011 11th IEEE/ACM International Symposium on
Conference_Location :
Newport Beach, CA
Print_ISBN :
978-1-4577-0129-0
Electronic_ISBN :
978-0-7695-4395-6
DOI :
10.1109/CCGrid.2011.49