DocumentCode :
244269
Title :
Failure Analysis of Virtual and Physical Machines: Patterns, Causes and Characteristics
Author :
Birke, Robert ; Giurgiu, Ioana ; Chen, L.Y. ; Wiesmann, Dorothea ; Engbersen, T.
Author_Institution :
IBM Res. Zurich Lab., Zurich, Switzerland
fYear :
2014
fDate :
23-26 June 2014
Firstpage :
1
Lastpage :
12
Abstract :
In today´s commercial data centers, the computation density grows continuously as the number of hardware components and workloads in units of virtual machines increase. The service availability guaranteed by data centers heavily depends on the reliability of the physical and virtual servers. In this study, we conduct an analysis on 10K virtual and physical machines hosted on five commercial data centers over an observation period of one year. Our objective is to establish a sound understanding of the differences and similarities between failures of physical and virtual machines. We first capture their failure patterns, i.e., the failure rates, the distributions of times between failures and of repair times, as well as, the time and space dependency of failures. Moreover, we correlate failures with the resource capacity and run-time usage to identify the characteristics of failing servers. Finally, we discuss how virtual machine management actions, i.e., consolidation and on/off frequency, impact virtual machine failures.
Keywords :
failure analysis; virtual machines; commercial data centers; failure analysis; physical machines; resource capacity; virtual machine management; virtual machines; virtual servers; Computer crashes; Correlation; Hardware; Maintenance engineering; Reliability; Servers; Software; Datacenters; VM failures; failure root causes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks (DSN), 2014 44th Annual IEEE/IFIP International Conference on
Conference_Location :
Atlanta, GA
Type :
conf
DOI :
10.1109/DSN.2014.18
Filename :
6903562
Link To Document :
بازگشت