DocumentCode
742369
Title
Exploiting Replicated Cache Blocks to Reduce L2 Cache Leakage in CMPs
Author
Hyunhee Kim ; Jung Ho Ahn ; Jihong Kim
Author_Institution
Samsung Electron. Co. Ltd., Suwon, South Korea
Volume
21
Issue
10
fYear
2013
Firstpage
1863
Lastpage
1877
Abstract
Modern chip multiprocessors (CMPs) employ large L2 caches to reduce the performance gap between processors and off-chip memory. However, as the size of an L2 cache increases, its leakage power consumption also becomes a major contributor to the total power dissipation. Managing the leakage power of L2 caches, therefore, is an important issue in realizing low-power CMPs. In CMPs with private L2 caches, each processor makes a copy of the data in its local cache in order to access the data faster, which is called replication. In this paper, we propose a novel leakage management technique that dynamically turns off replications in private L2 caches for leakage power reduction by exploiting two key observations: 1) the cost of an extra cache miss due to the turned-off replication is small because the same cache block exists in another on-chip cache and 2) turning off the replication incurs no extra cache miss if it is invalidated by other processors in order to maintain cache coherence. Since blindly turning off the frequently accessed replications can degrade performance, the proposed technique dynamically controls the number of turned-off replications. The proposed technique can be implemented by slightly modifying the MESI protocol with a new turned-off shared (TOS) coherence state. The TOS state indicates that the corresponding block is shared by other caches but turned off. Experiments on a four-processor CMP with private L2 caches show that the proposed technique reduces the energy consumption of the L2 caches and the main memory by 19.4% on average, with less than 1% performance loss over the existing cache leakage management technique.
Keywords
cache storage; leakage currents; low-power electronics; microprocessor chips; L2 cache leakage; MESI protocol; TOS coherence state; cache coherence; cache leakage management technique; chip multiprocessors; data faster; extra cache miss; four-processor CMP; leakage power consumption; leakage power reduction; low-power CMP; off-chip memory; on-chip cache; performance loss; power dissipation; private L2 caches; replicated cache blocks; turned-off replications; turned-off shared coherence state; Coherence; Degradation; Energy consumption; Power demand; Program processors; Protocols; System-on-a-chip; Cache coherence; chip multiprocessors (CMPs); leakage power management; private L2 caches; replication;
fLanguage
English
Journal_Title
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
Publisher
ieee
ISSN
1063-8210
Type
jour
DOI
10.1109/TVLSI.2012.2220791
Filename
6352931
Link To Document