DocumentCode :
3426226
Title :
Research on architecture and design principles of COTS components based generic fault-tolerant computer
Author :
Zhonghong, Ou ; Youguang, Yuan ; Xiaoyong, Zhao
Author_Institution :
Coll. of Comput. Sci., Harbin Eng. Univ., China
fYear :
2005
fDate :
12-14 Dec. 2005
Abstract :
A novel fault-tolerant architecture based on COTS components is put forward and implemented in this paper. In order to make observable the internal states of COTS components, and in order to concurrently perform fault-tolerance function and normal function and control the behavior of each COTS component, the authors have devised an intelligent hardware module dedicated to fault-tolerance processing, which can significantly offload application processors. This architecture digs every inherent fault-detection mechanism and adopts layered fault protection mechanism to raise fault-tolerance coverage. This architecture is efficient, flexible, scalable and transparent with respect to fault-tolerance. It is Byzantine fault safe and also supports online repair. The authors also raise some design tradeoffs when designing COTS components based fault-tolerant computer.
Keywords :
computer architecture; fault tolerant computing; Byzantine fault safety; COTS components based generic fault-tolerant computer; fault detection; fault-tolerant architecture; intelligent hardware module; layered fault protection; online repair; Application software; Computer architecture; Computer errors; Costs; Delay; Fault tolerance; Hardware; Partial response channels; Protection; Time sharing computer systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Computing, 2005. Proceedings. 11th Pacific Rim International Symposium on
Print_ISBN :
0-7695-2492-3
Type :
conf
DOI :
10.1109/PRDC.2005.53
Filename :
1607519
Link To Document :
بازگشت