• DocumentCode
    2577128
  • Title

    CDA: A Cloud Dependability Analysis Framework for Characterizing System Dependability in Cloud Computing Infrastructures

  • Author

    Guan, Qiang ; Chiu, Chi-Chen ; Fu, Song

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of North Texas, Denton, TX, USA
  • fYear
    2012
  • fDate
    18-19 Nov. 2012
  • Firstpage
    11
  • Lastpage
    20
  • Abstract
    Cloud computing has become increasingly popular by obviating the need for users to own and maintain complex computing infrastructure. However, due to their inherent complexity and large scale, production cloud computing systems are prone to various runtime problems caused by hardware and software failures. Dependability assurance is crucial for building sustainable cloud computing services. Although many techniques have been proposed to analyze and enhance reliability of distributed systems, there is little work on understanding the dependability of cloud computing environments. As virtualization has been an enabling technology for the cloud, it is imperative to investigate the impact of virtualization on the cloud dependability, which is the focus of this work. In this paper, we present a cloud dependability analysis (CDA) framework with mechanisms to characterize failure behavior in cloud computing infrastructures. We design the failure-metric DAGs (directed a cyclic graph) to analyze the correlation of various performance metrics with failure events in virtualized and non-virtualized systems. We study multiple types of failures. By comparing the generated DAGs in the two environments, we gain insight into the impact of virtualization on the cloud dependability. This paper is the first attempt to study this crucial issue. In addition, we exploit the identified metrics for failure detection. Experimental results from an on-campus cloud computing test bed show that our approach can achieve high detection accuracy while using a small number of performance metrics.
  • Keywords
    cloud computing; directed graphs; software fault tolerance; software reliability; system recovery; virtualisation; CDA; cloud computing environments; cloud computing infrastructures; cloud dependability analysis framework; complex computing infrastructure; dependability assurance; directed acyclic graph; distributed system reliability; failure detection; failure-metric DAG; hardware failures; nonvirtualized systems; on-campus cloud computing testbed; production cloud computing systems; software failures; sustainable cloud computing services; system dependability characterization; virtualization; Cloud computing; Correlation; Measurement; Servers; Virtual machine monitors; Virtual machining; Cloud Computing; Dependability; Evaluation; Failure Management; Virtualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Computing (PRDC), 2012 IEEE 18th Pacific Rim International Symposium on
  • Conference_Location
    Niigata
  • Print_ISBN
    978-1-4673-4849-2
  • Electronic_ISBN
    978-0-7695-4885-2
  • Type

    conf

  • DOI
    10.1109/PRDC.2012.10
  • Filename
    6385066