Title :
Practical data value speculation for future high-end processors
Author :
Perais, Arthur ; Seznec, Andre
Author_Institution :
IRISA, INRIA, Rennes, France
Abstract :
Dedicating more silicon area to single thread performance will necessarily be considered as worthwhile in future - potentially heterogeneous - multicores. In particular, Value prediction (VP) was proposed in the mid 90´s to enhance the performance of high-end uniprocessors by breaking true data dependencies. In this paper, we reconsider the concept of Value Prediction in the contemporary context and show its potential as a direction to improve current single thread performance. First, building on top of research carried out during the previous decade on confidence estimation, we show that every value predictor is amenable to very high prediction accuracy using very simple hardware. This clears the path to an implementation of VP without a complex selective reissue mechanism to absorb mispredictions. Prediction is performed in the in-order pipeline frond-end and validation is performed in the in-order pipeline back-end, while the out-of-order engine is only marginally modified. Second, when predicting back-to-back occurrences of the same instruction, previous context-based value predictors relying on local value history exhibit a complex critical loop that should ideally be implemented in a single cycle. To bypass this requirement, we introduce a new value predictor VTAGE harnessing the global branch history. VTAGE can seamlessly predict back-to-back occurrences, allowing predictions to span over several cycles. It achieves higher performance than previously proposed context-based predictors. Specifically, using SPEC´00 and SPEC´06 benchmarks, our simulations show that combining VTAGE and a stride based predictor yields up to 65% speedup on a fairly aggressive pipeline without support for selective reissue.
Keywords :
microprocessor chips; multiprocessing systems; pipeline processing; SPEC´00 benchmarks; SPEC´06 benchmarks; VP; VTAGE; aggressive pipeline; back-to-back occurrence prediction; complex critical loop; confidence estimation; context-based value predictors; data value speculation; global branch history; heterogeneous multicores; high-end uniprocessors; in-order pipeline back-end; in-order pipeline frond-end; out-of-order engine; silicon area; single thread performance; true data dependencies; Accuracy; Engines; Hardware; History; Pipelines; Radiation detectors; Registers;
Conference_Titel :
High Performance Computer Architecture (HPCA), 2014 IEEE 20th International Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/HPCA.2014.6835952