Title :
CDA: A Cloud Dependability Analysis Framework for Characterizing System Dependability in Cloud Computing Infrastructures
Author :
Guan, Qiang ; Chiu, Chi-Chen ; Fu, Song
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of North Texas, Denton, TX, USA
Abstract :
Cloud computing has become increasingly popular by obviating the need for users to own and maintain complex computing infrastructure. However, due to their inherent complexity and large scale, production cloud computing systems are prone to various runtime problems caused by hardware and software failures. Dependability assurance is crucial for building sustainable cloud computing services. Although many techniques have been proposed to analyze and enhance reliability of distributed systems, there is little work on understanding the dependability of cloud computing environments. As virtualization has been an enabling technology for the cloud, it is imperative to investigate the impact of virtualization on the cloud dependability, which is the focus of this work. In this paper, we present a cloud dependability analysis (CDA) framework with mechanisms to characterize failure behavior in cloud computing infrastructures. We design the failure-metric DAGs (directed a cyclic graph) to analyze the correlation of various performance metrics with failure events in virtualized and non-virtualized systems. We study multiple types of failures. By comparing the generated DAGs in the two environments, we gain insight into the impact of virtualization on the cloud dependability. This paper is the first attempt to study this crucial issue. In addition, we exploit the identified metrics for failure detection. Experimental results from an on-campus cloud computing test bed show that our approach can achieve high detection accuracy while using a small number of performance metrics.
Keywords :
cloud computing; directed graphs; software fault tolerance; software reliability; system recovery; virtualisation; CDA; cloud computing environments; cloud computing infrastructures; cloud dependability analysis framework; complex computing infrastructure; dependability assurance; directed acyclic graph; distributed system reliability; failure detection; failure-metric DAG; hardware failures; nonvirtualized systems; on-campus cloud computing testbed; production cloud computing systems; software failures; sustainable cloud computing services; system dependability characterization; virtualization; Cloud computing; Correlation; Measurement; Servers; Virtual machine monitors; Virtual machining; Cloud Computing; Dependability; Evaluation; Failure Management; Virtualization;
Conference_Titel :
Dependable Computing (PRDC), 2012 IEEE 18th Pacific Rim International Symposium on
Conference_Location :
Niigata
Print_ISBN :
978-1-4673-4849-2
Electronic_ISBN :
978-0-7695-4885-2
DOI :
10.1109/PRDC.2012.10