DocumentCode
707977
Title
Permanent fault detection and diagnosis in the lightweight dual modular redundancy architecture
Author
Ferreira, Ronaldo R. ; Sanchez, Ernesto ; da Rolt, Jean ; Nazar, Gabrie L. ; Moreira, Alvaro F. ; Carro, Luigi ; Sonza Reorda, Matteo
Author_Institution
Inst. de Inf., Univ. Fed. do Rio Grande do Sul, Porto Alegre, Brazil
fYear
2015
fDate
25-27 March 2015
Firstpage
1
Lastpage
6
Abstract
The Lightweight Dual Modular Redundancy (LDMR) is a fault tolerant architecture for low-latency soft error correction. The LDMR introduces a software compilation strategy that enforces error containment inside a basic block, allowing for a simplified error correction policy. This paper evaluates how the LDMR and its architectural components behave in the presence of permanent faults. It also classifies how sensitive the error detection and rollback machinery is to hard faults. By including permanent fault detection and diagnosis, the LDMR becomes a comprehensive fault tolerant architecture for embedded computing, covering a broad range of fault models. This paper also evaluates the LDMR´s performance overhead using a MiBench subset, which is currently 1.54 in average.
Keywords
error correction; error detection; fault diagnosis; radiation hardening (electronics); LDMR architecture; MiBench subset; error containment; error correction policy; error detection; fault tolerant architecture; lightweight dual modular redundancy; low-latency soft error correction; permanent fault detection; permanent fault diagnosis; rollback machinery; software compilation strategy; Circuit faults; Computer architecture; Machinery; Pipelines; Reduced instruction set computing; Registers; Tunneling magnetoresistance; architecture; error detection; fault injection; fault tolerance; modular redundancy; permanent errors;
fLanguage
English
Publisher
ieee
Conference_Titel
Test Symposium (LATS), 2015 16th Latin-American
Conference_Location
Puerto Vallarta
Type
conf
DOI
10.1109/LATW.2015.7102524
Filename
7102524
Link To Document