• DocumentCode
    1621260
  • Title

    Detailed design and evaluation of redundant multi-threading alternatives

  • Author

    Mukherjee, Shubhendu S. ; Kontz, Michael ; Reinhardt, Steven K.

  • Author_Institution
    Compaq Comput. Corp., Houston, TX, USA
  • fYear
    2002
  • fDate
    6/24/1905 12:00:00 AM
  • Firstpage
    99
  • Lastpage
    110
  • Abstract
    Exponential growth in the number of on-chip transistors, coupled. with reductions in. voltage levels, makes each generation of microprocessors increasingly vulnerable to transient faults. In a multi-threaded environment, we can detect these faults by running two copies of the same program as separate threads, feeding them identical inputs, and comparing their outputs, a technique we call redundant multi-threading (RMT). This paper studies RMT techniques in the context of both single- and dual-processor simultaneous multi-threaded (SMT) single-chip devices. Using a detailed, commercial-grade, SMT processor design we uncover subtle RMT implementation complexities, and find that RMT can be a more significant burden for single-processor devices than prior studies indicate. However, a novel application of RMT techniques in a dual-processor device, which we term chip-level redundant threading, shows higher performance than lock-stepping the two cores, especially on multi-threaded workloads
  • Keywords
    fault diagnosis; fault tolerant computing; multi-threading; parallel architectures; SRT processor; dual-processor device; fault-detection; microprocessors; redundant multithreading; simultaneous redundantly threaded processor; transient fault diagnosis; Degradation; Fault detection; Hardware; Microprocessors; Multithreading; Process design; Surface-mount technology; Transistors; Voltage; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture, 2002. Proceedings. 29th Annual International Symposium on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1063-6897
  • Print_ISBN
    0-7695-1605-X
  • Type

    conf

  • DOI
    10.1109/ISCA.2002.1003566
  • Filename
    1003566