DocumentCode
3207349
Title
Speculative instruction validation for performance-reliability trade-off
Author
Kumar, Sumeet ; Aggarwal, Aneesh
Author_Institution
Electr. & Comput. Eng., Binghamton Univ., Binghamton, NY
fYear
2008
fDate
16-20 Feb. 2008
Firstpage
405
Lastpage
414
Abstract
With reducing feature size, increasing chip capacity, and increasing clock speed, microprocessors are becoming increasingly susceptible to transient (soft) errors. Redundant multi-threading (RMT) is an attractive approach for concurrent error detection. RMT provides complete error coverage, while incurring a significant performance impact because of the redundant thread. Achieving perfect reliability at the expense of a high performance drop is not a good design option for systems where slight vulnerability may still achieve the desired error rates. In this paper, we explore speculative mechanisms to trade-off reliability for performance in RMT. Our basic approach validates the execution of an instruction by comparing its result against the expected result. Only those instructions are redundantly executed for which the validations fail. This mechanism is expected to have a minimal vulnerability impact because it is highly unlikely that an erroneous result matches the expected value. We also propose several extensions to the basic approach that further explore the performance-reliability trade-off design space. A combination of these techniques incur about 10% performance impact and about 0.09% undetected base error rate, compared to about 25% performance impact for RMT with no undetected errors.
Keywords
formal verification; microcomputers; multi-threading; concurrent error detection; microprocessors; performance-reliability trade-off; redundant multithreading; speculative instruction validation; Clocks; Computer aided instruction; Computer errors; Error analysis; Frequency; Microprocessors; Proposals; Redundancy; Voltage; Yarn; Concurrent Error Detection; Instruction Validation; Performance-Reliability Trade-off; Reducing Instruction Redundancy; Redundant Multi-threading;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computer Architecture, 2008. HPCA 2008. IEEE 14th International Symposium on
Conference_Location
Salt Lake City, UT
ISSN
1530-0897
Print_ISBN
978-1-4244-2070-4
Type
conf
DOI
10.1109/HPCA.2008.4658656
Filename
4658656
Link To Document