• DocumentCode
    3412421
  • Title

    Fault tolerant objects in distributed systems using hot replication

  • Author

    Beedubail, Ganesha ; Karmarkar, Anish ; Gurijala, Anil ; Marti, Willis ; Pooch, Udo

  • Author_Institution
    Dept. of Comput. Sci., Texas A&M Univ., College Station, TX, USA
  • fYear
    1996
  • fDate
    27-29 Mar 1996
  • Firstpage
    89
  • Lastpage
    95
  • Abstract
    This paper presents a new algorithm for supporting fault tolerant objects in distributed systems. The fault tolerance provided by the algorithm is fully user transparent. The algorithm uses a variation of object replication scheme, which we call the Hot Replication Scheme. The algorithm supports nested object invocations. The chief advantages of the scheme are: a) No action is needed in the case of failure of a secondary replica, b) The time to recover from a primary failure is minimal, c) Separation of replication protocol and reliable communication protocol. To recover from a primary failure the system need to (detect the failure and) select one of the secondaries to become the primary. The designated secondary can become primary once it has made sure that its current state is equivalent to the state of the failed primary (it can do so by processing outstanding requests, if any). This is in contrast with the checkpointing and rollback recovery scheme, where the recovery time can be substantial. Our algorithm exploits the general features and concepts associated with the notion of the objects and object interactions to its advantage
  • Keywords
    distributed processing; object-oriented programming; software fault tolerance; system recovery; checkpointing; distributed systems; fault tolerant objects; hot replication; hot replication scheme; nested object invocations; object interactions; primary failure; reliable communication protocol; replication protocol; rollback recovery scheme; Checkpointing; Computer science; Design methodology; Distributed computing; Fault tolerance; Fault tolerant systems; Message passing; Object oriented modeling; Physics computing; Protocols;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers and Communications, 1996., Conference Proceedings of the 1996 IEEE Fifteenth Annual International Phoenix Conference on
  • Conference_Location
    Scottsdale, AZ
  • Print_ISBN
    0-7803-3255-5
  • Type

    conf

  • DOI
    10.1109/PCCC.1996.493618
  • Filename
    493618