• DocumentCode
    3244358
  • Title

    Evaluating the impact of communication architecture on the performability of cluster-based services

  • Author

    Nagaraja, Kiran ; Krishnan, Neeraj ; Bianchini, Ricardo ; Martin, Richard P. ; Nguyen, Thu D.

  • Author_Institution
    Dept. of Comput. Sci., Rutgers Univ., USA
  • fYear
    2003
  • fDate
    8-12 Feb. 2003
  • Firstpage
    229
  • Lastpage
    240
  • Abstract
    We consider the impact of different communication architectures on the performability (performance plus availability) of cluster-based servers. In particular, we use a combination of fault-injection experiments and analytic modeling to evaluate the performability of two popular communication protocols, TCP and VIA, as the intra-cluster communication substrate of a sophisticated Web server. Our analysis leads to several interesting conclusions, the most surprising of which is, under the same fault load, VIA-based servers deliver greater availability than TCP-based servers. If we assume higher fault rates for VIA-based servers because the underlying technology is more immature and programming model more complex, we find that packet errors or application faults would have to occur at approximately 4 times the rate in TCP-based servers before their performabilities equalize. We use our results from the study to suggest that high-performance and robust communication layers for highly available cluster-based servers should preserve message boundaries, as opposed to using byte streams, use single-copy transfers, pre-allocate channel resources, and report errors in manner consistent with the network fabric´s fault model.
  • Keywords
    Internet; computer network reliability; fault tolerant computing; performance evaluation; transport protocols; workstation clusters; TCP; VIA; Web server; analytic modeling; availability; channel resource pre-allocation; cluster-based servers; cluster-based services; communication architecture; communication protocols; error reporting; fault rates; fault-injection experiments; intra-cluster communication substrate; message boundaries; packet errors; performability; performance; single-copy transfers; Availability; Computer architecture; Computer science; Network servers; Operating systems; Performance analysis; Performance evaluation; Protocols; Robustness; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High-Performance Computer Architecture, 2003. HPCA-9 2003. Proceedings. The Ninth International Symposium on
  • ISSN
    1530-0897
  • Print_ISBN
    0-7695-1871-0
  • Type

    conf

  • DOI
    10.1109/HPCA.2003.1183541
  • Filename
    1183541