Title :
Improving scratch-pad memory reliability through compiler-guided data block duplication
Author :
Li, F. ; Chen, G. ; Kandemir, M. ; Kolcu, I.
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Abstract :
Recent trends in embedded computing indicates an increasing use of scratch-pad memories (SPMs) as on-chip store for instructions and data. An important characteristic of these memory components is that they are managed by software, instead of hardware. Ever-scaling process technology and employment of several power-saving techniques in embedded systems (e.g., voltage scaling) make these systems particularly vulnerable to soft errors and other transient errors. Therefore, it is very important in practice to consider the impact of soft errors in SPMs. While it is possible to employ classical memory protection mechanisms such as parity checks and ECC, each of these has its drawbacks. Specifically, a pure parity-based protection cannot correct any errors, and ECCs can be an overkill in the normal operation state when no soft error is experienced. This paper proposes an alternate approach to protect SPMs against soft errors. The proposed approach is based on data block duplication under compiler control. More specifically, an optimizing compiler duplicates data blocks within the SPM and protects each data block by parity if such a duplication does not hurt performance. The goal of this scheme is to provide only parity protection for data blocks (and reduce the overheads at runtime when no error occurs) but correct errors using the duplicate (when an error occurs in the primary copy), provided that the duplicate is not corrupted.
Keywords :
cache storage; embedded systems; memory architecture; microprocessor chips; program compilers; ECC; compiler-guided data block duplication; embedded computing; embedded systems; memory components; memory protection mechanisms; on-chip storage; parity checks; parity-based protection; scratch-pad memory reliability; soft errors; transient error; voltage scaling; Embedded computing; Embedded system; Employment; Error correction; Hardware; Memory management; Power system management; Protection; Scanning probe microscopy; Voltage;
Conference_Titel :
Computer-Aided Design, 2005. ICCAD-2005. IEEE/ACM International Conference on
Print_ISBN :
0-7803-9254-X
DOI :
10.1109/ICCAD.2005.1560208