DocumentCode :
2496829
Title :
Specialized N-modular redundant processors in large-scale distributed systems
Author :
Yen, I-Ling
Author_Institution :
Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
fYear :
1996
fDate :
23-25 Oct 1996
Firstpage :
12
Lastpage :
21
Abstract :
Computers are being used to achieve increasingly sophisticated control for large and complex systems. Many of these systems require a large shared state-space or database. Thus, handling real-time concurrent accesses to a shared database is an essential feature for modern fault-tolerant systems. Many fault-tolerant systems have been implemented for uniformly tolerating various types of failures, such as MAFT (Multicomputer Architecture for Fault Tolerance), FTP (Fault-Tolerant Processor), FTPP (Fault-Tolerant Parallel Processors) and Delta-4. However, most of these either lack the notion of a shared state-space or do not efficiently support parallel tasks that concurrently access a shared state-space. We use a processor-specialization approach to increase the effectiveness of replication and, consequently, achieve cost-effective fault tolerance in such systems. The SNMR (specialized N-modular redundancy) protocol has been developed based on these concepts. Compared to many existing Byzantine-resilient systems, the SNMR approach incurs less overhead and can be easily parameterized to fit various fault models
Keywords :
access protocols; computerised control; concurrency control; database machines; distributed databases; large-scale systems; real-time systems; redundancy; replicated databases; software fault tolerance; state-space methods; very large databases; Byzantine-resilient systems; SNMR protocol; complex systems control; cost-effective fault tolerance; fail-stop failures; fault-tolerant systems; large shared database; large-scale distributed systems; malicious failures; overhead; parallel tasks; parameterizability; processor specialization; real-time concurrent accesses; replication effectiveness; shared state-space; specialized N-modular redundant processors; Access protocols; Concurrent computing; Control systems; Distributed computing; Electromagnetic radiation; Fault tolerance; Fault tolerant systems; Large-scale systems; Spatial databases; State-space methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliable Distributed Systems, 1996. Proceedings., 15th Symposium on
Conference_Location :
Nigara-on-the-Lake, Ont.
ISSN :
1060-9857
Print_ISBN :
0-8186-7481-4
Type :
conf
DOI :
10.1109/RELDIS.1996.559688
Filename :
559688
Link To Document :
بازگشت