Title :
Recovering from multiple process failures in the time warp mechanism
Author :
Agrawal, Divyakant ; Agre, Jonathan R.
Author_Institution :
Dept. of Comput. Sci., California Univ., Santa Barbara, CA, USA
fDate :
12/1/1992 12:00:00 AM
Abstract :
A recovery protocol for distributed systems using the time warp control mechanism is described. The proposed protocol is fault tolerant to multiple process failures. Time warp is an optimistic execution technique in which synchronization is achieved using rollback. The recovery protocol exploits the redundancy already available to implement process rollback in the time warp mechanism. Thus, the protocol has little additional bookkeeping overhead, which contrasts with many other recovery protocols
Keywords :
distributed algorithms; distributed processing; fault tolerant computing; protocols; distributed systems; fault tolerant; multiple process failures; optimistic execution; process rollback; recovery protocol; redundancy; synchronization; time warp control mechanism; Computational modeling; Concurrency control; Control systems; Discrete event simulation; Distributed processing; Fault tolerance; Helium; Parallel processing; Protocols; Transaction databases;
Journal_Title :
Computers, IEEE Transactions on