• DocumentCode
    2563127
  • Title

    Handling catastrophic failures in scalable Internet applications

  • Author

    Haungs, Michael ; Pandey, Raju ; Barr, Earl

  • Author_Institution
    Dept. of Comput. Sci., California Polytech., San Luis Obispo, CA, USA
  • fYear
    2004
  • fDate
    2004
  • Firstpage
    188
  • Lastpage
    194
  • Abstract
    User perceived quality is the most important aspect of Internet applications. After a single negative experience, users tend to switch to one of the other myriad of alternatives available to them on the Internet. Two key components of Internet application quality are scalability and reliability. In this paper, we present the first general-purpose mechanism capable of maintaining reliability in the face of process, machine, and catastrophic failures. We define catastrophic failures as events that cause entire clusters of servers to become unavailable such as network partitioning, router failures, natural disasters, or even terrorist attacks. Our mechanism utilizes client-side tunneling, client-side redirection, and implicit redirection triggers to deliver reliable communication channels. We capitalize on previous work, redirectable sockets (RedSocks), that focuses on Internet application scalability. RedSocks are communication channels enhanced with a novel session layer aimed at modernizing network communication. We modify Red-Socks to create the first fault tolerant socket solution that can handle all server-side failures. Our mechanism is compatible with NATs and firewalls, scalable, application independent, and backwards compatible.
  • Keywords
    Internet; computer network management; computer network reliability; fault tolerant computing; system recovery; telecommunication channels; Internet application quality; Internet application reliability; Internet application scalability; RedSocks; catastrophic failure; client-side redirection; client-side tunneling; communication channels; fault tolerance; machine failure; network communication; network partitioning; process failure; redirectable sockets; router failures; server clusters; server-side failures; Communication channels; Internet; Maintenance; Network servers; Scalability; Sockets; Switches; Terrorism; Tunneling; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications and the Internet, 2004. Proceedings. 2004 International Symposium on
  • Print_ISBN
    0-7695-2068-5
  • Type

    conf

  • DOI
    10.1109/SAINT.2004.1266115
  • Filename
    1266115