Author_Institution :
Microelectron. Center, Harbin Inst. of Technol., Harbin, China
Abstract :
Transient multiple cell upsets (MCUs) are becoming major issues in the reliability of memories exposed to radiation environment. To prevent MCUs from causing data corruption, more complex error correction codes (ECCs) are widely used to protect memory, but the main problem is that they would require higher delay overhead. Recently, matrix codes (MCs) based on Hamming codes have been proposed for memory protection. The main issue is that they are double error correction codes and the error correction capabilities are not improved in all cases. In this paper, novel decimal matrix code (DMC) based on divide-symbol is proposed to enhance memory reliability with lower delay overhead. The proposed DMC utilizes decimal algorithm to obtain the maximum error detection capability. Moreover, the encoder-reuse technique (ERT) is proposed to minimize the area overhead of extra circuits without disturbing the whole encoding and decoding processes. ERT uses DMC encoder itself to be part of the decoder. The proposed DMC is compared to well-known codes such as the existing Hamming, MCs, and punctured difference set (PDS) codes. The obtained results show that the mean time to failure (MTTF) of the proposed scheme is 452.9%, 154.6%, and 122.6% of Hamming, MC, and PDS, respectively. At the same time, the delay overhead of the proposed scheme is 73.1%, 69.0%, and 26.2% of Hamming, MC, and PDS, respectively. The only drawback to the proposed scheme is that it requires more redundant bits for memory protection.
Keywords :
error correction codes; integrated circuit reliability; integrated memory circuits; radiation hardening (electronics); DMC; DMC encoder; ECC; Hamming codes; MC; PDS; area overhead minimization; decimal algorithm; decimal matrix code; delay overhead; divide-symbol; double-error correction codes; encoder-reuse technique; encoding-decoding process; enhanced memory reliability; error correction capability; maximum error detection capability; mean time-to-failure; memory protection; punctured difference set code; radiation environment; transient MCU; transient multiple-cell upsets; Decimal algorithm; error correction codes (ECCs); mean time to failure (MTTF); memory; multiple cells upsets (MCUs);