Title :
Reducing embedded software radiation-induced failures through cache memories
Author :
Santini, Thomas ; Rech, P. ; Nazar, Gabriel ; Carro, Luigi ; Rech Wagner, Flavio
Author_Institution :
Inst. of Inf., Fed. Univ. of Rio Grande do Sul, Porto Alegre, Brazil
Abstract :
Cache memories are traditionally disabled in space-level and safety-critical applications, since it was believed that the sensitive area they introduce would compromise the system reliability. As technology has evolved, the speed gap between logic and main memory has increased in such a way that disabling caches slows the code much more than in the past. As a result, the processor is exposed for a much longer time in order to compute the same workload. In this paper we demonstrate that, on modern embedded processors, enabling caches may bring benefits to critical systems: the larger exposed area may be compensated by the shorter exposure time, leading to an overall improved reliability. We describe the Mean Workload Between Failures, an intuitive metric to evaluate the impact of enabling caches for a given generic application error rate. The proposed metric is experimentally validated through an extensive radiation test campaign using a 28 nm off-the-shelf ARM-based SoC as a case study. The failure probability of the bare-metal application is decreased when the L1 cache is enabled but increased when L2 is also enabled. We also discuss when L2 caches could make the device more reliable.
Keywords :
cache storage; embedded systems; integrated circuit reliability; probability; radiation hardening (electronics); system-on-chip; L1 cache; L2 cache; bare-metal application; cache memories; critical systems; disabling caches; embedded software radiation-induced failures; failure probability; generic application error rate; logic; main memory; mean-workload-between-failures; modern embedded processors; off-the-shelf ARM-based SoC; radiation test campaign; safety-critical applications; space-level applications; system reliability; Cache memory; Error correction codes; Neutrons; Performance evaluation; Reliability; Sensitivity;
Conference_Titel :
Test Symposium (ETS), 2014 19th IEEE European
Conference_Location :
Paderborn
DOI :
10.1109/ETS.2014.6847793