Title :
Advanced Validation of the DVMS Approach to Fully Distributed VM Scheduling
Author :
Quesnel, Flavien ; Lebre, Adrien ; Pastor, Jorge ; Sudholt, Mario ; Balouek, Daniel
Author_Institution :
ASCOLA Res. Group, Ecole des Mines de Nantes, Nantes, France
Abstract :
The holy grail for Infrastructure as a Service (IaaS) providers is to maximize the utilization of their infrastructure while ensuring the quality of service (QoS) for the virtual machines they host. Although the frameworks in charge of managing virtual machines (VM) on pools of physical ones (PM) have been significantly improved, enabling to manage large-scale infrastructures composed of hundreds of PMs, most of them do not efficiently handle the aforementioned objective. The main reason is that advanced scheduling policies are subject to important and hard scalability problems, that become even worse when VM image transfers have to be considered. In this article, we provide a new validation of the Distributed VM Scheduler approach (DVMS) in a twofold manner. First, we provide a formal proof of the algorithm based on temporal logic. Second, we discuss large-scale evaluations involving up to 4.7K VMs distributed over 467 nodes of the Grid´5000 testbed. As far as we know, these experiments constitute the largest in vivo validation that has been performed so far with decentralized VM schedulers. These results show that a cooperative approach such as ours permits to fix overload problems in a reactive and scalable way.
Keywords :
cloud computing; distributed processing; quality of service; scheduling; temporal logic; theorem proving; virtual machines; DVMS approach; Grid´5000 testbed; IaaS providers; PM; QoS; advanced scheduling policies; cooperative approach; distributed VM scheduler approach; formal proof; hard scalability problems; infrastructure as a service; large-scale infrastructure management; physical ones; quality of service; temporal logic; virtual machines; Nickel; Partitioning algorithms; Peer-to-peer computing; Processor scheduling; Scalability; Schedules; System recovery; autonomous systems; cooperative systems; distributed infrastructures; dynamic scheduling; event-based systems; multi-agent systems; reactivity; scalability; virtualization;
Conference_Titel :
Trust, Security and Privacy in Computing and Communications (TrustCom), 2013 12th IEEE International Conference on
Conference_Location :
Melbourne, VIC
DOI :
10.1109/TrustCom.2013.150