Title :
RDIS: A recursively defined invertible set scheme to tolerate multiple stuck-at faults in resistive memory
Author :
Melhem, Rami ; Maddah, Rakan ; Cho, Sangyeun
Author_Institution :
Comput. Sci. Dept., Univ. of Pittsburgh, Pittsburgh, PA, USA
Abstract :
With their potential for high scalability and density, resistive memories are foreseen as a promising technology that overcomes the physical limitations confronted by charge-based DRAM and flash memory. Yet, a main burden towards the successful adoption and commercialization of resistive memories is their low cell reliability caused by process variation and limited write endurance. Typically, faulty and worn-out cells are permanently stuck at either `0´ or `1´. To overcome the challenge, a robust error correction scheme that can recover from many hard faults is required. In this paper, we propose and evaluate RDIS, a novel scheme to efficiently tolerate memory stuck-at faults. RDIS allows for the correct retrieval of data by recursively determining and efficiently keeping track of the positions of the bits that are stuck at a value different from the ones that are written, and then, at read time, by inverting the values read from those positions. RDIS is characterized by a very low probability of failure that increases slowly with the relative increase in the number of faults. Moreover, RDIS tolerates many more faults than the best existing scheme-by up to 95% on average at the same overhead level.
Keywords :
DRAM chips; error correction codes; fault diagnosis; fault tolerance; flash memories; integrated circuit reliability; integrated circuit testing; phase change memories; probability; set theory; RDIS; cell reliability; charge-based DRAM; data retrieval; failure probability; faulty worn-out cells; flash memory; multiple stuck-at fault tolerance; process variation; read time; recursively defined invertible set scheme; resistive memory; robust error correction scheme; write endurance; Computer architecture; Error correction; Error correction codes; Hardware; Noise measurement; Radiation detectors; Random access memory; Error Correction Code; Fault Tolerance; Hard Faults; Phase Change Memory; Reliability;
Conference_Titel :
Dependable Systems and Networks (DSN), 2012 42nd Annual IEEE/IFIP International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4673-1624-8
Electronic_ISBN :
1530-0889
DOI :
10.1109/DSN.2012.6263949