Title :
Controlling the Deployment of Virtual Machines on Clusters and Clouds for Scientific Computing in CBRAIN
Author :
Glatard, T. ; Rousseau, Marc-Etienne ; Rioux, Pierre ; Adalat, Reza ; Evans, A.C.
Author_Institution :
McConnell Brain Imaging Centre, McGill Univ., Montreal, QC, Canada
Abstract :
The emergence of hardware virtualization, notably exploited by cloud infrastructures, led to a paradigm shift in distributed computing by enabling complete software customization and elastic scaling of resources. However, new software architectures and deployment algorithms are still required to fully exploit virtualization in web platforms used for scientific computing, commonly called science gateways. We propose a software architecture and an algorithm to enable and optimize the deployment of virtual machines on clusters and clouds in science gateways. Our architecture is based on 3 design principles: (i) separation between resource provisioning and task scheduling (ii) encapsulation of VMs in regular computing tasks (iii) association of a virtual computing site to each disk image. Our algorithm submits and removes VMs on clusters and clouds based on the current system workload, the number of available job slots in active VMs, the cost and current performance of clouds clusters, and a parameter quantifying the performance-cost trade-off. To cope with variable queuing and booting times, it replicates VMs on independent computing sites selected from a minimization of a make span-cost linear combination in the Pareto set of non-dominated solutions. Make span and cost are estimated from the last measured queuing, booting, and task execution times, using an exponential model of the gain yielded by VM replication. We implement this algorithm in CBRAIN, a science gateway widely used for neuroimaging, and we evaluate it on an infrastructure of 2 clusters and 1 cloud. Results show that it is able to reach some points of the performance-cost trade-off associated to VM deployment.
Keywords :
Pareto optimisation; cloud computing; distributed algorithms; medical computing; resource allocation; scheduling; virtual machines; virtualisation; workstation clusters; CBRAIN scientific computing; Pareto set; VM deployment; cloud clusters; cloud infrastructures; deployment algorithms; deployment optimization; distributed computing; hardware virtualization; make span-cost linear combination; neuroimaging; resource provisioning; resource scaling; software customization; task scheduling; virtual computing; virtual machine deployment; Booting; Clouds; Clustering algorithms; Logic gates; Monitoring; Portals; Virtual machining; Cloud; Clusters; Multi-objective optimization; Science gateways; Software architecture; Virtual machines;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
Conference_Location :
Chicago, IL
DOI :
10.1109/CCGrid.2014.42