Title :
TinyChecker: Transparent protection of VMs against hypervisor failures with nested virtualization
Author :
Tan, Cheng ; Xia, Yubin ; Chen, Haibo ; Zang, Binyu
Author_Institution :
Inst. of Parallel & Distrib. Syst., Shanghai Jiao Tong Univ., Shanghai, China
Abstract :
The increasing amount of resources in a single machine constantly increases the level of server consolidation for virtualization. However, along with the improvement of server efficiency, the dependability of the virtualization layer is not being progressed towards the right direction; instead, the hypervisor level is more vulnerable to diverse failures due to the increasing complexity and scale of the hypervisor layer. This makes tens to hundreds of production VMs in a machine easily risk a single point of failure. This paper tries to mitigate this problem by proposing a technique called TinyChecker, which uses a tiny nested hypervisor to transparently protect guest VMs against failures in the hypervisor layer. TinyChecker is a very small software layer designated for transparent failure detection and recovery, whose reliability can be guaranteed by its small size and possible further formal verification. TinyChecker records all the communication context between VM and hypervisor, protects the critical VM data, detects and recovers the hypervisors among failures. TinyChecker is currently still in an early stage, we report our design consideration and initial evaluation results.
Keywords :
checkpointing; fault tolerant computing; formal verification; software reliability; virtual machines; virtualisation; TinyChecker; VM transparent protection; communication context; critical VM data protection; formal verification; nested virtualization; server consolidation; server efficiency improvement; software layer; tiny-nested hypervisor failures; transparent failure detection; transparent failure recovery; virtual machines; virtualization layer dependability; Computer crashes; Context; Detectors; Registers; Reliability; Software; Virtual machine monitors; Failure detection; Fault tolerance; Hypervisor failure;
Conference_Titel :
Dependable Systems and Networks Workshops (DSN-W), 2012 IEEE/IFIP 42nd International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4673-2264-5
Electronic_ISBN :
978-1-4673-2265-2
DOI :
10.1109/DSNW.2012.6264691