DocumentCode
9286
Title
CIDR: A Cache Inspired Area-Efficient DRAM Resilience Architecture against Permanent Faults
Author
Seongil, O. ; Sanghyuk Kwon ; Young Hoon Son ; Yujin Park ; Jung Ho Ahn
Author_Institution
Dept. of Transdisciplinary Studies, Seoul Nat. Univ., Seoul, South Korea
Volume
14
Issue
1
fYear
2015
fDate
Jan.-June 1 2015
Firstpage
17
Lastpage
20
Abstract
Faulty cells have become major problems in cost-sensitive main-memory DRAM devices. Conventional solutions to reduce device failure rates due to cells with permanent faults, such as populating spare rows and relying on errorcorrecting codes, have had limited success due to high area overheads. In this paper, we propose CIDR, a novel cache-inspired DRAM resilience architecture, which substantially reduces the area overhead of handling bit errors from these faulty cells. A DRAM device adopting CIDR has a small cache next to its I/O pads to replace accesses to the addresses that include the faulty cells with ones that correspond to the cache data array. We minimize the energy overhead of accessing the cache tags for every read or write by adding a Bloom filter in front of the cache. The augmented cache is programmed once during the testing phase and is out of the critical path on normal accesses because both cache and DRAM arrays are accessed in parallel, making CIDR transparent to existing processor-memory interfaces. Compared to the conventional architecture relying on spare rows, CIDR lowers the area overhead of achieving equal failure rates over a wide range of single-bit error rates, such as 23.6× lower area overhead for a bit-error rate of 10-5 and a device failure rate of 10-3.
Keywords
DRAM chips; cache storage; data structures; error statistics; fault diagnosis; memory architecture; Bloom filter; CIDR; DRAM arrays; I/O pads; area overhead; area-efficient DRAM resilience architecture; augmented cache; bit errors; cache data array; cache tags; cache-inspired DRAM resilience architecture; cost-sensitive main-memory DRAM devices; device failure rates; energy overhead minimization; faulty cells; permanent faults; processor-memory interfaces; single-bit error rates; testing phase; Arrays; Circuit faults; Decoding; Random access memory; Resilience; Testing; DRAM, error resilience, permanent faults, row and column sparing, Bloom filter, DRAM-side caching;
fLanguage
English
Journal_Title
Computer Architecture Letters
Publisher
ieee
ISSN
1556-6056
Type
jour
DOI
10.1109/LCA.2014.2324894
Filename
6817525
Link To Document