Title :
Meeting midway: improving CMP performance with memory-side prefetching
Author :
Beckmann, Nathan ; Sanchez, Daniel
Author_Institution :
Massachusetts Inst. of Technol., Cambridge, MA, USA
Abstract :
Shared last-level caches, widely used in chip-multi-processors (CMPs), face two fundamental limitations. First, the latency and energy of shared caches degrade as the system scales up. Second, when multiple workloads share the CMP, they suffer from interference in shared cache accesses. Unfortunately, prior research addressing one issue either ignores or worsens the other: NUCA techniques reduce access latency but are prone to hotspots and interference, and cache partitioning techniques only provide isolation but do not reduce access latency.
Keywords :
cache storage; multiprocessor interconnection networks; shared memory systems; CMP; Jigsaw; NUCA techniques; access latency reduction; cache partitioning techniques; chip-multiprocessors; multiple workloads; scalable software-defined caches; shared cache accesses; shared last-level caches; Coherence; Hardware; Interference; Monitoring; Quality of service; Runtime; Software; NOC; memory; prefetching;
Conference_Titel :
Parallel Architectures and Compilation Techniques (PACT), 2013 22nd International Conference on
Conference_Location :
Edinburgh
Print_ISBN :
978-1-4799-1018-2
DOI :
10.1109/PACT.2013.6618818