Title :
Architectural and compiler techniques for energy reduction in high-performance microprocessors
Author :
Bellas, Nikolaos ; Hajj, Ibrahim N. ; Polychronopoulos, Constantine D. ; Stamoulis, George
Author_Institution :
Digital/DNA Syst. Archit. Lab., Motorola Inc., Schaumburg, IL, USA
fDate :
6/1/2000 12:00:00 AM
Abstract :
In this paper, we focus on low-power design techniques for high-performance processors at the architectural and compiler levels. We focus mainly on developing methods for reducing the energy dissipated in the on-chip caches. Energy dissipated in caches represents a substantial portion in the energy budget of today\´s processors. Extrapolating current trends, this portion is likely to increase in the near future, since the devices devoted to the caches occupy an increasingly larger percentage of the total area of the chip. We propose a method that uses an additional minicache located between the I-Cache and the central processing unit (CPU) core and buffers instructions that are nested within loops and are continuously otherwise fetched from the I-Cache. This mechanism is combined with code modifications, through the compiler, that greatly simplify the required hardware, eliminate unnecessary instruction fetching, and consequently reduce signal switching activity and the dissipated energy. We show that the additional cache, dubbed L-Cache, is much smaller and simpler than the I-Cache when the compiler assumes the role of allocating instructions to it. Through simulation, we show that for the SPECfp95 benchmarks, the I-Cache remains disabled most of the time, and the "cheaper" extra cache is used instead. We also propose different techniques that are better adapted to nonnumeric nonloop-intensive code.
Keywords :
cache storage; computer architecture; instruction sets; integrated circuit design; low-power electronics; microprocessor chips; program compilers; I-Cache; L-Cache; SPECfp95 benchmarks; architectural techniques; central processing unit; code modifications; compiler techniques; dissipated energy; energy reduction; high-performance microprocessors; instruction fetching; low-power design techniques; nonnumeric nonloop-intensive code; on-chip caches; signal switching activity; Central Processing Unit; Design optimization; Energy consumption; Frequency; Hardware; Microprocessors; Power dissipation; Power system reliability; Throughput; Voltage;
Journal_Title :
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on