Title :
CIDR: A Cache Inspired Area-Efficient DRAM Resilience Architecture against Permanent Faults
Author :
Seongil, O. ; Sanghyuk Kwon ; Young Hoon Son ; Yujin Park ; Jung Ho Ahn
Author_Institution :
Dept. of Transdisciplinary Studies, Seoul Nat. Univ., Seoul, South Korea
Abstract :
Faulty cells have become major problems in cost-sensitive main-memory DRAM devices. Conventional solutions to reduce device failure rates due to cells with permanent faults, such as populating spare rows and relying on errorcorrecting codes, have had limited success due to high area overheads. In this paper, we propose CIDR, a novel cache-inspired DRAM resilience architecture, which substantially reduces the area overhead of handling bit errors from these faulty cells. A DRAM device adopting CIDR has a small cache next to its I/O pads to replace accesses to the addresses that include the faulty cells with ones that correspond to the cache data array. We minimize the energy overhead of accessing the cache tags for every read or write by adding a Bloom filter in front of the cache. The augmented cache is programmed once during the testing phase and is out of the critical path on normal accesses because both cache and DRAM arrays are accessed in parallel, making CIDR transparent to existing processor-memory interfaces. Compared to the conventional architecture relying on spare rows, CIDR lowers the area overhead of achieving equal failure rates over a wide range of single-bit error rates, such as 23.6× lower area overhead for a bit-error rate of 10-5 and a device failure rate of 10-3.
Keywords :
DRAM chips; cache storage; data structures; error statistics; fault diagnosis; memory architecture; Bloom filter; CIDR; DRAM arrays; I/O pads; area overhead; area-efficient DRAM resilience architecture; augmented cache; bit errors; cache data array; cache tags; cache-inspired DRAM resilience architecture; cost-sensitive main-memory DRAM devices; device failure rates; energy overhead minimization; faulty cells; permanent faults; processor-memory interfaces; single-bit error rates; testing phase; Arrays; Circuit faults; Decoding; Random access memory; Resilience; Testing; DRAM, error resilience, permanent faults, row and column sparing, Bloom filter, DRAM-side caching;
Journal_Title :
Computer Architecture Letters
DOI :
10.1109/LCA.2014.2324894