The performance impact of incomplete bypassing in processor pipelines

Author

Ahuja, Pritpal S. ; Clark, Douglas W. ; Rogers, Anne

Author_Institution

Dept. of Comput. Sci., Princeton Univ., NJ, USA

fYear

1995

fDate

29 Nov-1 Dec 1995

Firstpage

36

Lastpage

45

Abstract

Pipelined processors employ hardware bypassing to eliminate certain pipeline hazards. By passing is logically simple but can be costly, especially in wide issue and deeply pipelined machines. In this paper bypassing is studied in detail, with an emphasis on designs in which the bypassing network is not complete. Cycle-level simulations of a model of integer and floating-point pipelines running some of the SPEC92 benchmarks show that at least half of the instructions executed used a bypassed register result from a previous instruction. Missing bypasses induce interlock stalls. The paper reports measurements of the performance impact of a number of pipeline configurations with incomplete bypassing networks. This impact ranges from a slowdown of just a few percent for a configuration with one late bypass missing to a slowdown of almost a factor of two for the integer pipe with no bypassing at all. Two types of code alterations reduce the new interlock stalls. A simple code transformation, the interchange of operands in instructions that perform commutative operations, cuts the performance loss from interlock stalls in certain configurations between about 20 and 50 percent. The second transformation is to reschedule code within basic blocks to avoid any missing bypasses. In five individual experiments with a small number of configurations and two benchmarks, this rescheduling saved 25 to 50 percent of the interlock stalls. In certain configurations both transformations can be applied

Keywords

hazards and race conditions; parallel architectures; performance evaluation; pipeline processing; SPEC92 benchmarks; cycle-level simulations; incomplete bypassing; incomplete bypassing networks; interlock stalls; performance; pipeline hazards; processor pipelines; rescheduling; slowdown; Computer science; Hardware; Hazards; Logic; Microprocessors; Performance loss; Pipelines; Registers; VLIW; Wiring;

fLanguage

English

Publisher

ieee

Conference_Titel

Microarchitecture, 1995., Proceedings of the 28th Annual International Symposium on

Conference_Location

Ann Arbor, MI

ISSN

1072-4451

Print_ISBN

0-8186-7349-4

Type

conf

DOI

10.1109/MICRO.1995.476811

Filename

476811