DocumentCode
1663023
Title
Best of both latency and throughput
Author
Grochowski, Ed ; Ronen, Ronny ; Shen, John ; Wang, Hong
Author_Institution
Intel Labs., Santa Clara, CA, USA
fYear
2004
Firstpage
236
Lastpage
243
Abstract
This paper describes the tradeoff between latency performance and throughput performance in a power-constrained environment. We show that the key to achieving both excellent latency performance as well as excellent throughput performance is to dynamically vary the amount of energy expended to process instructions according to the amount of parallelism available in the software. We survey four techniques for achieving variable energy per instruction: voltage/frequency scaling, asymmetric cores, variable-size cores, and speculation control. We estimate the potential range of energies obtainable by each technique and conclude that a combination of asymmetric cores and voltage/frequency scaling offers the most promising approach to design a chip-level multiprocessor that can achieve both excellent latency performance and excellent throughput performance.
Keywords
microprocessor chips; multiprocessing systems; pipeline processing; asymmetric cores; chip level multiprocessor design; latency performance; pipeline processing; speculation control; throughput performance; variable energy per instruction; variable size cores; voltage-frequency scaling; Delay; Educational institutions; Frequency estimation; Microarchitecture; Microprocessors; Parallel processing; Software performance; Throughput; Trademarks; Voltage control;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Design: VLSI in Computers and Processors, 2004. ICCD 2004. Proceedings. IEEE International Conference on
ISSN
1063-6404
Print_ISBN
0-7695-2231-9
Type
conf
DOI
10.1109/ICCD.2004.1347928
Filename
1347928
Link To Document