• DocumentCode
    3130192
  • Title

    A complexity-effective approach to ALU bandwidth enhancement for instruction-level temporal redundancy

  • Author

    Parashar, Angshuman ; Gurumurthi, Sudhanva ; Sivasubramaniam, Anand

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2004
  • fDate
    19-23 June 2004
  • Firstpage
    376
  • Lastpage
    386
  • Abstract
    Previous proposals for implementing instruction-level temporal redundancy in out-of-order cores have reported a performance degradation of up to 45% in certain applications compared to an execution which does not have any temporal redundancy. An important contributor to this problem is the insufficient number of ALUs for handling the amplified load injected into the core. At the same time, increasing the number of ALUs can increase the complexity of the issue logic, which has been pointed out to be one of the most timing critical components of the processor. This paper proposes a novel extension of a prior idea on instruction reuse to ease ALU bandwidth requirements in a complexity-effective way by exploiting certain interesting properties of a dual (temporally redundant) instruction stream. We present microarchitectural extensions necessary for implementing an instruction reuse buffer (IRB) and integrating this with the issue logic of a dual instruction stream superscalar core, and conduct extensive evaluations to demonstrate how well it can alleviate the ALU bandwidth problem. We show that on the average we can gain back nearly 50% of the IPC loss that occurred due to ALU bandwidth limitations for an instruction-level temporally redundant superscalar execution, and 23% of the overall IPC loss.
  • Keywords
    buffer storage; instruction sets; logic design; microprocessor chips; multiprocessing systems; parallel architectures; redundancy; ALU bandwidth enhancement; ALU bandwidth requirements; IPC loss; complexity-effective design; dual instruction stream superscalar core; instruction reuse buffer; instruction-level temporal redundancy; load handling; microarchitectural extensions; processor components; temporally redundant instruction stream; Application software; Bandwidth; Computer science; Degradation; Hardware; Logic; Out of order; Proposals; Redundancy; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture, 2004. Proceedings. 31st Annual International Symposium on
  • ISSN
    1063-6897
  • Print_ISBN
    0-7695-2143-6
  • Type

    conf

  • DOI
    10.1109/ISCA.2004.1310789
  • Filename
    1310789