• DocumentCode
    2977694
  • Title

    Reunion: Complexity-Effective Multicore Redundancy

  • Author

    Smolens, Jared C. ; Gold, Brian T. ; Falsafi, Babak ; Hoe, James C.

  • Author_Institution
    Comput. Archit. Lab., Carnegie Mellon Univ., Pittsburgh, PA
  • fYear
    2006
  • fDate
    Dec. 2006
  • Firstpage
    223
  • Lastpage
    234
  • Abstract
    To protect processor logic from soft errors, multicore redundant architectures execute two copies of a program on separate cores of a chip multiprocessor (CMP). Maintaining identical instruction streams is challenging because redundant cores operate independently, yet must still receive the same inputs (e.g., load values and shared-memory invalidations). Past proposals strictly replicate load values across two cores, requiring significant changes to the highly-optimized core. We make the key observation that, in the common case, both cores load identical values without special hardware. When the cores do receive different load values (e.g., due to a data race), the same mechanisms employed for soft error detection and recovery can correct the difference. This observation permits designs that relax input replication, while still providing correct redundant execution. In this paper, we present Reunion, an execution model that provides relaxed input replication and preserves the existing memory interface, coherence protocols, and consistency models. We evaluate a CMP-based implementation of the Reunion execution model with full-system, cycle-accurate simulation. We show that the performance overhead of relaxed input replication is only 5% and 6% for commercial and scientific workloads, respectively
  • Keywords
    cache storage; computer architecture; instruction sets; microprocessor chips; chip multiprocessor; coherence protocol; instruction stream; memory consistency model; memory interface; multicore redundant architecture; relaxed input replication; soft error detection; Coherence; Computer architecture; Error correction; Gold; Laboratories; Microarchitecture; Multicore processing; Protection; Protocols; Redundancy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Microarchitecture, 2006. MICRO-39. 39th Annual IEEE/ACM International Symposium on
  • Conference_Location
    Orlando, FL
  • ISSN
    1072-4451
  • Print_ISBN
    0-7695-2732-9
  • Type

    conf

  • DOI
    10.1109/MICRO.2006.42
  • Filename
    4041849