DocumentCode
3130192
Title
A complexity-effective approach to ALU bandwidth enhancement for instruction-level temporal redundancy
Author
Parashar, Angshuman ; Gurumurthi, Sudhanva ; Sivasubramaniam, Anand
Author_Institution
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fYear
2004
fDate
19-23 June 2004
Firstpage
376
Lastpage
386
Abstract
Previous proposals for implementing instruction-level temporal redundancy in out-of-order cores have reported a performance degradation of up to 45% in certain applications compared to an execution which does not have any temporal redundancy. An important contributor to this problem is the insufficient number of ALUs for handling the amplified load injected into the core. At the same time, increasing the number of ALUs can increase the complexity of the issue logic, which has been pointed out to be one of the most timing critical components of the processor. This paper proposes a novel extension of a prior idea on instruction reuse to ease ALU bandwidth requirements in a complexity-effective way by exploiting certain interesting properties of a dual (temporally redundant) instruction stream. We present microarchitectural extensions necessary for implementing an instruction reuse buffer (IRB) and integrating this with the issue logic of a dual instruction stream superscalar core, and conduct extensive evaluations to demonstrate how well it can alleviate the ALU bandwidth problem. We show that on the average we can gain back nearly 50% of the IPC loss that occurred due to ALU bandwidth limitations for an instruction-level temporally redundant superscalar execution, and 23% of the overall IPC loss.
Keywords
buffer storage; instruction sets; logic design; microprocessor chips; multiprocessing systems; parallel architectures; redundancy; ALU bandwidth enhancement; ALU bandwidth requirements; IPC loss; complexity-effective design; dual instruction stream superscalar core; instruction reuse buffer; instruction-level temporal redundancy; load handling; microarchitectural extensions; processor components; temporally redundant instruction stream; Application software; Bandwidth; Computer science; Degradation; Hardware; Logic; Out of order; Proposals; Redundancy; Timing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Architecture, 2004. Proceedings. 31st Annual International Symposium on
ISSN
1063-6897
Print_ISBN
0-7695-2143-6
Type
conf
DOI
10.1109/ISCA.2004.1310789
Filename
1310789
Link To Document