• DocumentCode
    3207349
  • Title

    Speculative instruction validation for performance-reliability trade-off

  • Author

    Kumar, Sumeet ; Aggarwal, Aneesh

  • Author_Institution
    Electr. & Comput. Eng., Binghamton Univ., Binghamton, NY
  • fYear
    2008
  • fDate
    16-20 Feb. 2008
  • Firstpage
    405
  • Lastpage
    414
  • Abstract
    With reducing feature size, increasing chip capacity, and increasing clock speed, microprocessors are becoming increasingly susceptible to transient (soft) errors. Redundant multi-threading (RMT) is an attractive approach for concurrent error detection. RMT provides complete error coverage, while incurring a significant performance impact because of the redundant thread. Achieving perfect reliability at the expense of a high performance drop is not a good design option for systems where slight vulnerability may still achieve the desired error rates. In this paper, we explore speculative mechanisms to trade-off reliability for performance in RMT. Our basic approach validates the execution of an instruction by comparing its result against the expected result. Only those instructions are redundantly executed for which the validations fail. This mechanism is expected to have a minimal vulnerability impact because it is highly unlikely that an erroneous result matches the expected value. We also propose several extensions to the basic approach that further explore the performance-reliability trade-off design space. A combination of these techniques incur about 10% performance impact and about 0.09% undetected base error rate, compared to about 25% performance impact for RMT with no undetected errors.
  • Keywords
    formal verification; microcomputers; multi-threading; concurrent error detection; microprocessors; performance-reliability trade-off; redundant multithreading; speculative instruction validation; Clocks; Computer aided instruction; Computer errors; Error analysis; Frequency; Microprocessors; Proposals; Redundancy; Voltage; Yarn; Concurrent Error Detection; Instruction Validation; Performance-Reliability Trade-off; Reducing Instruction Redundancy; Redundant Multi-threading;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computer Architecture, 2008. HPCA 2008. IEEE 14th International Symposium on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1530-0897
  • Print_ISBN
    978-1-4244-2070-4
  • Type

    conf

  • DOI
    10.1109/HPCA.2008.4658656
  • Filename
    4658656