Title :
Exploiting microarchitectural redundancy for defect tolerance
Author :
Shivakumar, Premkishore ; Keckler, Stephen W. ; Moore, Charles R. ; Burger, Doug
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA
fDate :
Sept. 30 2012-Oct. 3 2012
Abstract :
The continued increase in microprocessor clock frequency that has come from advancements in fabrication technology and reductions in feature size, creates challenges in maintaining both manufacturing yield rates and long-term reliability of devices. Methods based on defect detection and reduction may not offer a scalable solution due to cost of eliminating contaminants in the manufacturing process and increasing chip complexity. This paper proposes to use the inherent redundancy available in existing and future chip microarchitectures to improve yield and enable graceful performance degradation in fail-in-place systems. We introduce a new yield metric called performance averaged yield (Ypav) which accounts both for fully functional chips and those that exhibit some performance degradation. Our results indicate that at 250nm we are able to increase the Ypav of a uniprocessor with only redundant rows in its caches from a base value of 85% to 98% using microarchitectural redundancy. Given constant chip area, shrinking feature sizes increases fault susceptibility and reduces the base Ypav to 60% at 50nm, which exploiting microarchitectural redundancy then increases to 99.6%.
Keywords :
computer architecture; fault diagnosis; integrated circuit design; integrated circuit reliability; integrated circuit yield; microprocessor chips; redundancy; chip area; chip complexity; chip microarchitecture; defect detection; defect reduction; defect tolerance; fabrication technology; fail-in-place system; fault susceptibility; feature size reduction; feature size shrinking; functional chips; long-term device reliability; manufacturing process; manufacturing yield rate; microarchitectural redundancy; microprocessor clock frequency; performance averaged yield; performance degradation; uniprocessor; yield metric; Benchmark testing; Computational modeling; Computers; Microarchitecture; Program processors; Redundancy; System-on-a-chip;
Conference_Titel :
Computer Design (ICCD), 2012 IEEE 30th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-3051-0
DOI :
10.1109/ICCD.2012.6378613