• DocumentCode
    1663023
  • Title

    Best of both latency and throughput

  • Author

    Grochowski, Ed ; Ronen, Ronny ; Shen, John ; Wang, Hong

  • Author_Institution
    Intel Labs., Santa Clara, CA, USA
  • fYear
    2004
  • Firstpage
    236
  • Lastpage
    243
  • Abstract
    This paper describes the tradeoff between latency performance and throughput performance in a power-constrained environment. We show that the key to achieving both excellent latency performance as well as excellent throughput performance is to dynamically vary the amount of energy expended to process instructions according to the amount of parallelism available in the software. We survey four techniques for achieving variable energy per instruction: voltage/frequency scaling, asymmetric cores, variable-size cores, and speculation control. We estimate the potential range of energies obtainable by each technique and conclude that a combination of asymmetric cores and voltage/frequency scaling offers the most promising approach to design a chip-level multiprocessor that can achieve both excellent latency performance and excellent throughput performance.
  • Keywords
    microprocessor chips; multiprocessing systems; pipeline processing; asymmetric cores; chip level multiprocessor design; latency performance; pipeline processing; speculation control; throughput performance; variable energy per instruction; variable size cores; voltage-frequency scaling; Delay; Educational institutions; Frequency estimation; Microarchitecture; Microprocessors; Parallel processing; Software performance; Throughput; Trademarks; Voltage control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Design: VLSI in Computers and Processors, 2004. ICCD 2004. Proceedings. IEEE International Conference on
  • ISSN
    1063-6404
  • Print_ISBN
    0-7695-2231-9
  • Type

    conf

  • DOI
    10.1109/ICCD.2004.1347928
  • Filename
    1347928