DocumentCode :
704734
Title :
Micro-architecture independent analytical processor performance and power modeling
Author :
Van den Steen, Sam ; De Pestel, Sander ; Mechri, Moncef ; Eyerman, Stijn ; Carlson, Trevor ; Black-Schaffer, David ; Hagersten, Erik ; Eeckhout, Lieven
Author_Institution :
Dept. of Electron. & Inf. Syst., Ghent Univ., Ghent, Belgium
fYear :
2015
fDate :
29-31 March 2015
Firstpage :
32
Lastpage :
41
Abstract :
Optimizing processors for specific application(s) can substantially improve energy-efficiency. With the end of Dennard scaling, and the corresponding reduction in energyefficiency gains from technology scaling, such approaches may become increasingly important. However, designing applicationspecific processors require fast design space exploration tools to optimize for the targeted application(s). Analytical models can be a good fit for such design space exploration as they provide fast performance estimations and insight into the interaction between an application´s characteristics and the micro-architecture of a processor. Unfortunately, current analytical models require some microarchitecture dependent inputs, such as cache miss rates, branch miss rates and memory-level parallelism. This requires profiling the applications for each cache and branch predictor configuration, which is far more time-consuming than evaluating the actual performance models. In this work we present a micro-architecture independent profiler and associated analytical models that allow us to produce performance and power estimates across a large design space almost instantaneously. We show that using a micro-architecture independent profile leads to a speedup of 25× for our evaluated design space, compared to an analytical model that uses micro-architecture dependent profiles. Over a large design space, the model has a 13% error for performance and a 7% error for power, compared to cycle-level simulation. The model is able to accurately determine the optimal processor configuration for different applications under power or performance constraints, and it can provide insight into performance through cycle stacks.
Keywords :
cache storage; computer architecture; microprocessor chips; power aware computing; branch miss rates; branch predictor configuration; cache miss rates; cache predictor configuration; cycle stacks; cycle-level simulation; energy-efficiency gains; fast design space exploration tools; memory-level parallelism; microarchitecture dependent inputs; microarchitecture independent analytical processor performance; microarchitecture independent profiler; performance constraints; power estimates; power modeling; processor microarchitecture; processor optimization; technology scaling; Analytical models; Computational modeling; Entropy; Estimation; Mathematical model; Parallel processing; Predictive models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Performance Analysis of Systems and Software (ISPASS), 2015 IEEE International Symposium on
Conference_Location :
Philadelphia, PA
Type :
conf
DOI :
10.1109/ISPASS.2015.7095782
Filename :
7095782
Link To Document :
بازگشت