• DocumentCode
    2353820
  • Title

    Phase-based reboot: Reusing operating system execution phases for cheap reboot-based recovery

  • Author

    Yamakita, Kazuya ; Yamada, Hiroshi ; Kono, Kenji

  • Author_Institution
    Keio Univ., Yokohama, Japan
  • fYear
    2011
  • fDate
    27-30 June 2011
  • Firstpage
    169
  • Lastpage
    180
  • Abstract
    Although operating systems (OSes) are crucial to achieving high availability of computer systems, modern OSes are far from bug-free. Rebooting the OS is simple, powerful, and sometimes the only remedy for kernel failures. Once we accept reboot-based recovery as a fact of life, we should try to ensure that the downtime caused by reboots is as short as possible. This paper presents “phase-based” reboots that shorten the downtime caused by reboot-based recovery. The key idea is to divide a boot sequence into phases. The phase-based reboot reuses a system state in the previous boot if the next boot reproduces the same state. A prototype of the phase-based reboot was implemented on Xen 3.4.1 running para-virtualized Linux 2.6.18. Experiments with the prototype show that it successfully recovered from kernel transient failures inserted by a fault injector, and its downtime was 34.3 to 93.6% shorter than that of the normal reboot-based recovery.
  • Keywords
    operating systems (computers); software reusability; Xen 3.4.1; cheap reboot-based recovery; operating system reuse; para-virtualized Linux 2.6.18; phase-based reboot; Hardware; Image restoration; Kernel; Linux; Random access memory; Transient analysis; Operating System Reliability; Reboot-based Recovery; Virtualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Systems & Networks (DSN), 2011 IEEE/IFIP 41st International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1530-0889
  • Print_ISBN
    978-1-4244-9232-9
  • Electronic_ISBN
    1530-0889
  • Type

    conf

  • DOI
    10.1109/DSN.2011.5958216
  • Filename
    5958216