Title :
Best of both latency and throughput
Author :
Grochowski, Ed ; Ronen, Ronny ; Shen, John ; Wang, Hong
Author_Institution :
Intel Labs., Santa Clara, CA, USA
Abstract :
This paper describes the tradeoff between latency performance and throughput performance in a power-constrained environment. We show that the key to achieving both excellent latency performance as well as excellent throughput performance is to dynamically vary the amount of energy expended to process instructions according to the amount of parallelism available in the software. We survey four techniques for achieving variable energy per instruction: voltage/frequency scaling, asymmetric cores, variable-size cores, and speculation control. We estimate the potential range of energies obtainable by each technique and conclude that a combination of asymmetric cores and voltage/frequency scaling offers the most promising approach to design a chip-level multiprocessor that can achieve both excellent latency performance and excellent throughput performance.
Keywords :
microprocessor chips; multiprocessing systems; pipeline processing; asymmetric cores; chip level multiprocessor design; latency performance; pipeline processing; speculation control; throughput performance; variable energy per instruction; variable size cores; voltage-frequency scaling; Delay; Educational institutions; Frequency estimation; Microarchitecture; Microprocessors; Parallel processing; Software performance; Throughput; Trademarks; Voltage control;
Conference_Titel :
Computer Design: VLSI in Computers and Processors, 2004. ICCD 2004. Proceedings. IEEE International Conference on
Print_ISBN :
0-7695-2231-9
DOI :
10.1109/ICCD.2004.1347928