DocumentCode :
3103559
Title :
A Three-Phases Byzantine Fault Tolerance Mechanism for HLA-Based Simulation
Author :
Li, Zengxiang ; Cai, Wentong ; Turner, Stephen John ; Pan, Ke
Author_Institution :
Parallel & Distrib. Comput. Center, Nanyang Technol. Univ., Singapore, Singapore
fYear :
2010
fDate :
17-20 Oct. 2010
Firstpage :
149
Lastpage :
158
Abstract :
A large scale HLA-based simulation (federation) is composed of a large number of simulation components (federates), which may be developed by different participants and executed at different locations. Byzantine failures, caused by malicious attacks and software/hardware bugs, might happen to federates and propagate in the federation execution. In this paper, a three-phases (i.e., failure detection, failure location, and failure recovery) Byzantine Fault Tolerance (BFT) mechanism is proposed based on the decoupled federate architecture. By combining the replication, check pointing and message logging techniques, some redundant executions of federate replicas are avoided. The BFT mechanism is implemented using both Barrier and No-Barrier federate replication structures. Protocols are also developed to remove the epidemic effect caused by Byzantine failures. As the experiment results show, the BFT mechanism using No-Barrier replication outperforms that using Barrier replication significantly in the case that federate replicas have different runtime performance.
Keywords :
checkpointing; message passing; program debugging; software architecture; software fault tolerance; BFT mechanism; Byzantine failure; Byzantine fault tolerance; barrier replication; check pointing; decoupled federate architecture; epidemic effect; failure detection; failure location; failure recovery; federation execution; hardware bug; large scale HLA-based simulation; malicious attack; message logging; no-barrier federate replication structure; protocol; software bug; Biological system modeling; Computational modeling; Computer architecture; Computer crashes; Fault tolerance; Fault tolerant systems; Protocols; Byzantine fault tolerance; HLA-based simulation; checkpointing; decoupled federate architecture; federate replication; message logging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Simulation and Real Time Applications (DS-RT), 2010 IEEE/ACM 14th International Symposium on
Conference_Location :
Fairfax, VA
ISSN :
1550-6525
Print_ISBN :
978-1-4244-8651-9
Type :
conf
DOI :
10.1109/DS-RT.2010.24
Filename :
5636701
Link To Document :
بازگشت