Title :
Embedded way prediction for last-level caches
Author :
Sleiman, Faissal M. ; Dreslinski, Ronald G. ; Wenisch, Thomas F.
Author_Institution :
Electr. Eng. & Comput. Sci., Univ. of Michigan, Ann Arbor, MI, USA
fDate :
Sept. 30 2012-Oct. 3 2012
Abstract :
This paper investigates Embedded Way Prediction for large last-level caches (LLCs): an architecture and circuit design to provide the latency of parallel tag-data access at substantial energy savings. Existing way prediction approaches for L1 caches are compromised by the high associativity and filtered temporal locality of LLCs. We demonstrate: (1) the need for wide partial tag comparison, which we implement with a dynamic CAM alongside the data sub-array wordline decode, and (2) the inhibit bit, an architectural innovation to provide accurate predictions when the partial tag comparison is inconclusive. We present circuit critical-path and architectural power/performance studies demonstrating speedups of up to 15.4% (6.6% average) for scientific and server applications, matching the performance of parallel tag-data access while reducing energy overhead by 40%.
Keywords :
cache storage; embedded systems; L1 caches; LLC; architectural innovation; circuit critical-path; circuit design; data subarray wordline decode; dynamic CAM; embedded way prediction; energy overhead reduction; last-level caches; parallel tag-data access; partial tag comparison; temporal locality; wide partial tag comparison; Arrays; Computer aided manufacturing; Decoding; Delay; Random access memory; Servers; Tiles;
Conference_Titel :
Computer Design (ICCD), 2012 IEEE 30th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-3051-0
DOI :
10.1109/ICCD.2012.6378636