Title :
The impact of delay on the design of branch predictors
Author :
Jiménez, Daniel A. ; Keckler, Stephen W. ; Lin, Calvin
Author_Institution :
Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
Abstract :
Modern microprocessors employ increasingly complicated branch predictors to achieve instruction fetch bandwidth that is sufficient for wide out-of-order execution cores. While existing predictors can still be accessed in a single clock cycle, recent studies show that slower wires and faster clock rates will require multi-cycle access times to large on-chip structures, such as branch prediction tables. Thus, future branch predictors must consider not only area and accuracy, but also delay. The paper explores these tradeoffs in designing branch predictors and shows that increased accuracy alone cannot overcome the penalties in delay that arise with larger predictor structures. We evaluate three schemes for accommodating delay: a caching approach, an overriding approach, and a cascading lookahead approach. While we use a common branch predictor, gshare, as the prediction component, these schemes can be constructed using most types of predictors
Keywords :
cache storage; microprocessor chips; parallel architectures; pipeline processing; branch prediction tables; branch predictor design; caching approach; cascading lookahead approach; clock rates; common branch predictor; complicated branch predictors; delay; future branch predictors; gshare; instruction fetch bandwidth; large on-chip structures; modern microprocessors; multi-cycle access times; overriding approach; prediction component; predictor structures; single clock cycle; wide out-of-order execution cores; Bandwidth; Clocks; Computer aided instruction; Delay; Frequency; Hardware; History; Microprocessors; Out of order; Wires;
Conference_Titel :
Microarchitecture, 2000. MICRO-33. Proceedings. 33rd Annual IEEE/ACM International Symposium on
Conference_Location :
Monterey, CA
Print_ISBN :
0-7695-0924-X
DOI :
10.1109/MICRO.2000.898059