DocumentCode
2353820
Title
Phase-based reboot: Reusing operating system execution phases for cheap reboot-based recovery
Author
Yamakita, Kazuya ; Yamada, Hiroshi ; Kono, Kenji
Author_Institution
Keio Univ., Yokohama, Japan
fYear
2011
fDate
27-30 June 2011
Firstpage
169
Lastpage
180
Abstract
Although operating systems (OSes) are crucial to achieving high availability of computer systems, modern OSes are far from bug-free. Rebooting the OS is simple, powerful, and sometimes the only remedy for kernel failures. Once we accept reboot-based recovery as a fact of life, we should try to ensure that the downtime caused by reboots is as short as possible. This paper presents “phase-based” reboots that shorten the downtime caused by reboot-based recovery. The key idea is to divide a boot sequence into phases. The phase-based reboot reuses a system state in the previous boot if the next boot reproduces the same state. A prototype of the phase-based reboot was implemented on Xen 3.4.1 running para-virtualized Linux 2.6.18. Experiments with the prototype show that it successfully recovered from kernel transient failures inserted by a fault injector, and its downtime was 34.3 to 93.6% shorter than that of the normal reboot-based recovery.
Keywords
operating systems (computers); software reusability; Xen 3.4.1; cheap reboot-based recovery; operating system reuse; para-virtualized Linux 2.6.18; phase-based reboot; Hardware; Image restoration; Kernel; Linux; Random access memory; Transient analysis; Operating System Reliability; Reboot-based Recovery; Virtualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems & Networks (DSN), 2011 IEEE/IFIP 41st International Conference on
Conference_Location
Hong Kong
ISSN
1530-0889
Print_ISBN
978-1-4244-9232-9
Electronic_ISBN
1530-0889
Type
conf
DOI
10.1109/DSN.2011.5958216
Filename
5958216
Link To Document