Title :
Bounding rollback-recovery of large distributed computation in WAN environment
Author :
Yang, Jin-Min ; Zhang, Da-Fang
Author_Institution :
Coll. of Software, Hunan Univ., ChangSha, China
Abstract :
In the existing optimistic message logging protocols, the dependency must be tracked in whole system, and all processes are involved in rollback recovery in the event of failure. For large distributed computation in WAN environment with the low available bandwidth and high transmission latency, its fault-free overhead and recovery overhead are outstanding, recovery efficiency decreasing with the scale of system. This paper introduces a three-layer model of large distributed system in WAN environment, and presents a protocol of message dependency tracking based on proxy. Utilizing private proxy to log messages and dependencies, the protocol limits rollback-recovery to a scope called block rather than the entire system, achieving relative low fault-free overhead and fast output commit, as well as improved recovery efficiency and low recovery overhead.
Keywords :
message passing; protocols; software fault tolerance; system recovery; wide area networks; WAN environment; fault-free overhead; large distributed computation; message dependency tracking protocol; optimistic message logging protocols; private proxy; recovery overhead; rollback-recovery; Bandwidth; Delay; Distributed computing; Internet; Large-scale systems; Local area networks; Message passing; Protocols; Testing; Wide area networks;
Conference_Titel :
Test Symposium, 2004. 13th Asian
Print_ISBN :
0-7695-2235-1
DOI :
10.1109/ATS.2004.27