DocumentCode :
2189229
Title :
Co-Locating Virtual Machine Logging and Replay for Recording System Failures
Author :
Kawasaki, Jin ; Oikawa, Shuichi
Author_Institution :
Dept. of Comput. Sci., Univ. of Tsukuba, Tsukuba, Japan
fYear :
2010
fDate :
June 29 2010-July 1 2010
Firstpage :
1352
Lastpage :
1357
Abstract :
There can be more system failures in the near future because of the combination of increased software complexity and a wide variety of usage patters. It is, however, difficult or sometimes almost impossible to find the root causes of the failures only from the limited and unreliable information provided by customers. Therefore, it is important to equip a feature that enables the complete tracing of system failures. We propose a system that employs two virtual machines, one for the primary execution and the other for the backup execution. The backup virtual machine maintains the past state of the primary virtual machine along with the log to make the backup the same state as the primary. When a system failure occurs on the primary virtual machine, the VMM saves the backup state and the log. By replaying the backup virtual machine from the saved state following the saved log, the execution path to the failure can be completely traced. We developed such a logging and replaying feature in a VMM. It can log and replay the execution of the Linux operating system. The experiments show that the overhead of the primary execution is only fractional, and the overhead of the replaying execution on the backup is less than 2%.
Keywords :
Linux; software metrics; system recovery; virtual machines; Linux operating system; backup execution; backup virtual machine; co-locating virtual machine logging; primary execution; recording system failure; software complexity; system failures; Electric breakdown; Linux; Operating systems; Radiation detectors; Timing; Virtual machining; dependability; operating system; virtual machine monitor;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (CIT), 2010 IEEE 10th International Conference on
Conference_Location :
Bradford
Print_ISBN :
978-1-4244-7547-6
Type :
conf
DOI :
10.1109/CIT.2010.242
Filename :
5577854
Link To Document :
بازگشت