DocumentCode
3103559
Title
A Three-Phases Byzantine Fault Tolerance Mechanism for HLA-Based Simulation
Author
Li, Zengxiang ; Cai, Wentong ; Turner, Stephen John ; Pan, Ke
Author_Institution
Parallel & Distrib. Comput. Center, Nanyang Technol. Univ., Singapore, Singapore
fYear
2010
fDate
17-20 Oct. 2010
Firstpage
149
Lastpage
158
Abstract
A large scale HLA-based simulation (federation) is composed of a large number of simulation components (federates), which may be developed by different participants and executed at different locations. Byzantine failures, caused by malicious attacks and software/hardware bugs, might happen to federates and propagate in the federation execution. In this paper, a three-phases (i.e., failure detection, failure location, and failure recovery) Byzantine Fault Tolerance (BFT) mechanism is proposed based on the decoupled federate architecture. By combining the replication, check pointing and message logging techniques, some redundant executions of federate replicas are avoided. The BFT mechanism is implemented using both Barrier and No-Barrier federate replication structures. Protocols are also developed to remove the epidemic effect caused by Byzantine failures. As the experiment results show, the BFT mechanism using No-Barrier replication outperforms that using Barrier replication significantly in the case that federate replicas have different runtime performance.
Keywords
checkpointing; message passing; program debugging; software architecture; software fault tolerance; BFT mechanism; Byzantine failure; Byzantine fault tolerance; barrier replication; check pointing; decoupled federate architecture; epidemic effect; failure detection; failure location; failure recovery; federation execution; hardware bug; large scale HLA-based simulation; malicious attack; message logging; no-barrier federate replication structure; protocol; software bug; Biological system modeling; Computational modeling; Computer architecture; Computer crashes; Fault tolerance; Fault tolerant systems; Protocols; Byzantine fault tolerance; HLA-based simulation; checkpointing; decoupled federate architecture; federate replication; message logging;
fLanguage
English
Publisher
ieee
Conference_Titel
Distributed Simulation and Real Time Applications (DS-RT), 2010 IEEE/ACM 14th International Symposium on
Conference_Location
Fairfax, VA
ISSN
1550-6525
Print_ISBN
978-1-4244-8651-9
Type
conf
DOI
10.1109/DS-RT.2010.24
Filename
5636701
Link To Document