DocumentCode :
3522881
Title :
A bandwidth-aware memory-subsystem resource management using non-invasive resource profilers for large CMP systems
Author :
Kaseridis, Dimitris ; Stuecheli, J. ; Chen, Jian ; John, Lizy K.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
fYear :
2010
fDate :
9-14 Jan. 2010
Firstpage :
1
Lastpage :
11
Abstract :
By integrating multiple cores in a single chip, Chip Multiprocessors (CMP) provide an attractive approach to improve both system throughput and efficiency. This integration allows the sharing of on-chip resources which may lead to destructive interference between the executing workloads. Memorysubsystem is an important shared resource that contributes significantly to the overall throughput and power consumption. In order to prevent destructive interference, the cache capacity and memory bandwidth requirements of the last level cache have to be controlled. While previously proposed schemes focus on resource sharing within a chip, we explore additional possibilities both inside and outside a single chip. We propose a dynamic memory-subsystem resource management scheme that considers both cache capacity and memory bandwidth contention in large multi-chip CMP systems. Our approach uses low overhead, non-invasive resource profilers that are based on Mattson´s stack distance algorithm to project each core´s resource requirements and guide our cache partitioning algorithms. Our bandwidth-aware algorithm seeks for throughput optimizations among multiple chips by migrating workloads from the most resource-overcommitted chips to the ones with more available resources. Use of bandwidth as a criterion results in an overall 18% reduction in memory bandwidth along with a 7.9% reduction in miss rate, compared to existing resource management schemes. Using a cycle-accurate full system simulator, our approach achieved an average improvement of 8.5% on throughput.
Keywords :
cache storage; multiprocessing systems; storage management chips; system-on-chip; Mattson´s stack distance algorithm; bandwidth-aware algorithm; bandwidth-aware memory-subsystem resource management; cache capacity; cache partitioning algorithms; chip multiprocessors; cycle-accurate full system simulator; destructive interference; dynamic memory-subsystem resource management scheme; last level cache; memory bandwidth contention; memory bandwidth requirements; memory subsystem; multichip CMP systems; multiple cores; non-invasive resource profilers; resource requirements; resource sharing; resource-overcommitted chips; single chip; throughput optimizations; Bandwidth; Energy consumption; Heuristic algorithms; Interference; Memory management; Partitioning algorithms; Quality of service; Resource management; Throughput; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computer Architecture (HPCA), 2010 IEEE 16th International Symposium on
Conference_Location :
Bangalore
ISSN :
1530-0897
Print_ISBN :
978-1-4244-5658-1
Type :
conf
DOI :
10.1109/HPCA.2010.5416654
Filename :
5416654
Link To Document :
بازگشت