• DocumentCode
    451253
  • Title

    Quantifying and Improving the Availability of High-Performance Cluster-Based Internet Services

  • Author

    Nagaraja, Kiran ; Krishnan, Neeraj ; Bianchini, Ricardo ; Martin, Richard P. ; Nguyen, Thu D.

  • Author_Institution
    Rutgers University
  • fYear
    2003
  • fDate
    15-21 Nov. 2003
  • Firstpage
    27
  • Lastpage
    27
  • Abstract
    Cluster-based servers can substantially increase performance when nodes cooperate to globally manage resources. However, in this paper we show that cooperation results in a substantial availability loss, in the absence of high-availability mechanisms. Specifically, we show that a sophisticated cluster-based Web server, which gains a factor of 3 in performance through cooperation, increases service unavailability by a factor of 10 over a non-cooperative version. We then show how to augment this Web server with software components embodying a small set of high-availability techniques to regain the lost availability. Among other interesting observations, we show that the application of multiple high-availability techniques, each implemented independently in its own subsystem, can lead to inconsistent recovery actions. We also show that a novel technique called Fault Model Enforcement can be used to resolve such inconsistencies. Augmenting the server with these techniques led to a final expected availability of close to 99.99%.
  • Keywords
    Application software; Availability; Computer science; Performance gain; Permission; Resource management; Scalability; Throughput; Web and internet services; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, 2003 ACM/IEEE Conference
  • Print_ISBN
    1-58113-695-1
  • Type

    conf

  • DOI
    10.1109/SC.2003.10046
  • Filename
    1592930