DocumentCode :
1553624
Title :
Error Tolerance in Server Class Processors
Author :
Rivers, Jude A. ; Gupta, Meeta S. ; Shin, Jeonghee ; Kudva, Prabhakar N. ; Bose, Pradip
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
30
Issue :
7
fYear :
2011
fDate :
7/1/2011 12:00:00 AM
Firstpage :
945
Lastpage :
959
Abstract :
This paper provides: 1) a very brief motivation and technological trend data to show why hard and soft errors are expected to be of increasing concern in the future; 2) a summary review of chip-level error tolerance practices today-with a brief reference to IBM´s POWER6 and POWER7 designs; 3) open research challenges and current solution approaches of promise, based on published literature; and 4) concluding remarks.
Keywords :
error statistics; fault tolerance; integrated circuit design; integrated circuit reliability; microprocessor chips; IBM; POWER6 design; POWER7 design; chip-level error tolerance practices; current solution approaches; hard errors; open research challenges; published literature; server class processors; soft errors; Error correction codes; Microprocessors; Pipelines; Program processors; Redundancy; Registers; Checkpointing; error recovery; lifetime reliability; soft errors; variability;
fLanguage :
English
Journal_Title :
Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
0278-0070
Type :
jour
DOI :
10.1109/TCAD.2011.2158100
Filename :
5875991
Link To Document :
بازگشت