DocumentCode :
2947107
Title :
MorphCache: A Reconfigurable Adaptive Multi-level Cache hierarchy
Author :
Srikantaiah, Shekhar ; Kultursay, Emre ; Zhang, Tao ; Kandemir, Mahmut ; Irwin, Mary Jane ; Xie, Yuan
Author_Institution :
Pennsylvania State Univ., University Park, PA, USA
fYear :
2011
fDate :
12-16 Feb. 2011
Firstpage :
231
Lastpage :
242
Abstract :
Given the diverse range of application characteristics that chip multiprocessors (CMPs) need to cater to, a “one-cache-topology-fits-all” design philosophy will clearly be inadequate. In this paper, we propose MorphCache, a Reconfigurable Adaptive Multi-level Cache hierarchy. Mor-phCache dynamically tunes a multi-level cache topology in a CMP to allow significantly different cache topologies to exist on the same architecture. Starting from per-core L2 and L3 cache slices as the basic design point, MorphCache alters the cache topology dynamically by merging or splitting cache slices and modifying the accessibility of different cache slice groups to different cores in a CMP. We evaluated MorphCache on a 16 core CMP on a full system simulator and found that it significantly improves both average throughput and harmonic mean of speedups of diverse multithreaded and multiprogrammed workloads. Specifically, our results show that MorphCache improves throughput of the multiprogrammed mixes by 29.9% over a topology with all-shared L2 and L3 caches and 27.9% over a topology with per core private L2 cache and shared L3 cache. In addition, we also compared MorphCache to partitioning a single shared cache at each level using promotion/insertion pseudo-partitioning (PIPP) [28] and managing per-core private cache at each level using dynamic spill receive caches (DSR) [18]. We found that MorphCache improves average throughput by 6.6% over PIPP and by 5.7% over DSR when applied to both L2 and L3 caches.
Keywords :
cache storage; microprocessor chips; multiprocessing systems; CMP; chip multiprocessors; diverse multithreaded; morphcache; multiprogrammed workloads; promotion/insertion pseudo-partitioning; reconfigurable adaptive multi-level cache hierarchy; Benchmark testing; Estimation; Instruction sets; Merging; System-on-a-chip; Throughput; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computer Architecture (HPCA), 2011 IEEE 17th International Symposium on
Conference_Location :
San Antonio, TX
ISSN :
1530-0897
Print_ISBN :
978-1-4244-9432-3
Type :
conf
DOI :
10.1109/HPCA.2011.5749732
Filename :
5749732
Link To Document :
بازگشت