• DocumentCode
    2958732
  • Title

    A Self-tuning Failure Detection Scheme for Cloud Computing Service

  • Author

    Naixue Xiong ; Vasilakos, Athanasios V. ; Jie Wu ; Yang, Y. Richard ; Rindos, Andy ; Yuezhi Zhou ; Wen-Zhan Song ; Yi Pan

  • Author_Institution
    Dept. of Comput. Sci., Georgia State Univ., Atlanta, GA, USA
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    668
  • Lastpage
    679
  • Abstract
    Cloud computing is an increasingly important solution for providing services deployed in dynamically scalable cloud networks. Services in the cloud computing networks may be virtualized with specific servers which host abstracted details. Some of the servers are active and available, while others are busy or heavy loaded, and the remaining are offline for various reasons. Users would expect the right and available servers to complete their application requirements. Therefore, in order to provide an effective control scheme with parameter guidance for cloud resource services, failure detection is essential to meet users´ service expectations. It can resolve possible performance bottlenecks in providing the virtual service for the cloud computing networks. Most existing Failure Detector (FD) schemes do not automatically adjust their detection service parameters for the dynamic network conditions, thus they couldn´t be used for actual application. This paper explores FD properties with relation to the actual and automatic fault-tolerant cloud computing networks, and find a general non-manual analysis method to self-tune the corresponding parameters to satisfy user requirements. Based on this general automatic method, we propose specific and dynamic Self-tuning Failure Detector, called SFD, as a major breakthrough in the existing schemes. We carry out actual and extensive experiments to compare the quality of service performance between the SFD and several other existing FDs. Our experimental results demonstrate that our scheme can automatically adjust SFD control parameters to obtain corresponding services and satisfy user requirements, while maintaining good performance. Such an SFD can be extensively applied to industrial and commercial usage, and it can also significantly benefit the cloud computing networks.
  • Keywords
    cloud computing; software fault tolerance; system recovery; FD; automatic fault-tolerant cloud computing networks; cloud computing networks; cloud computing service; dynamically scalable cloud networks; failure detector; host abstracted details; nonmanual analysis method; self tuning failure detection scheme; virtual service; Cloud computing; Computational modeling; Computer crashes; Delay; Education; Heart beat; Quality of service; Application requirements; Cloud computing service; Fault tolerance; Quality of service; Self-tuning failure detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-0975-2
  • Type

    conf

  • DOI
    10.1109/IPDPS.2012.126
  • Filename
    6267868