Title :
Permanent fault detection and diagnosis in the lightweight dual modular redundancy architecture
Author :
Ferreira, Ronaldo R. ; Sanchez, Ernesto ; da Rolt, Jean ; Nazar, Gabrie L. ; Moreira, Alvaro F. ; Carro, Luigi ; Sonza Reorda, Matteo
Author_Institution :
Inst. de Inf., Univ. Fed. do Rio Grande do Sul, Porto Alegre, Brazil
Abstract :
The Lightweight Dual Modular Redundancy (LDMR) is a fault tolerant architecture for low-latency soft error correction. The LDMR introduces a software compilation strategy that enforces error containment inside a basic block, allowing for a simplified error correction policy. This paper evaluates how the LDMR and its architectural components behave in the presence of permanent faults. It also classifies how sensitive the error detection and rollback machinery is to hard faults. By including permanent fault detection and diagnosis, the LDMR becomes a comprehensive fault tolerant architecture for embedded computing, covering a broad range of fault models. This paper also evaluates the LDMR´s performance overhead using a MiBench subset, which is currently 1.54 in average.
Keywords :
error correction; error detection; fault diagnosis; radiation hardening (electronics); LDMR architecture; MiBench subset; error containment; error correction policy; error detection; fault tolerant architecture; lightweight dual modular redundancy; low-latency soft error correction; permanent fault detection; permanent fault diagnosis; rollback machinery; software compilation strategy; Circuit faults; Computer architecture; Machinery; Pipelines; Reduced instruction set computing; Registers; Tunneling magnetoresistance; architecture; error detection; fault injection; fault tolerance; modular redundancy; permanent errors;
Conference_Titel :
Test Symposium (LATS), 2015 16th Latin-American
Conference_Location :
Puerto Vallarta
DOI :
10.1109/LATW.2015.7102524