DocumentCode
422726
Title
Bounding rollback-recovery of large distributed computation in WAN environment
Author
Yang, Jin-Min ; Zhang, Da-Fang
Author_Institution
Coll. of Software, Hunan Univ., ChangSha, China
fYear
2004
fDate
15-17 Nov. 2004
Firstpage
394
Lastpage
399
Abstract
In the existing optimistic message logging protocols, the dependency must be tracked in whole system, and all processes are involved in rollback recovery in the event of failure. For large distributed computation in WAN environment with the low available bandwidth and high transmission latency, its fault-free overhead and recovery overhead are outstanding, recovery efficiency decreasing with the scale of system. This paper introduces a three-layer model of large distributed system in WAN environment, and presents a protocol of message dependency tracking based on proxy. Utilizing private proxy to log messages and dependencies, the protocol limits rollback-recovery to a scope called block rather than the entire system, achieving relative low fault-free overhead and fast output commit, as well as improved recovery efficiency and low recovery overhead.
Keywords
message passing; protocols; software fault tolerance; system recovery; wide area networks; WAN environment; fault-free overhead; large distributed computation; message dependency tracking protocol; optimistic message logging protocols; private proxy; recovery overhead; rollback-recovery; Bandwidth; Delay; Distributed computing; Internet; Large-scale systems; Local area networks; Message passing; Protocols; Testing; Wide area networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Test Symposium, 2004. 13th Asian
ISSN
1081-7735
Print_ISBN
0-7695-2235-1
Type
conf
DOI
10.1109/ATS.2004.27
Filename
1376590
Link To Document