DocumentCode :
1335322
Title :
Codesign of NoC and Cache Organization for Reducing Access Latency in Chip Multiprocessors
Author :
Abousamra, A. ; Jones, A.K. ; Melhem, R.
Author_Institution :
Dept. of Comput. Sci., Univ. of Pittsburgh, Pittsburgh, PA, USA
Volume :
23
Issue :
6
fYear :
2012
fDate :
6/1/2012 12:00:00 AM
Firstpage :
1038
Lastpage :
1046
Abstract :
Reducing data access latency is vital to achieving performance improvements in computing. For chip multiprocessors (CMPs), data access latency depends on the organization of the memory hierarchy, the on-chip interconnect, and the running workload. Several network-on-chip (NoC) designs exploit communication locality to reduce communication latency by configuring special fast paths or circuits on which communication is faster than the rest of the NoC. However, communication patterns are directly affected by the cache organization and many cache organizations are designed in isolation of the underlying NoC or assume a simple NoC design, thus possibly missing optimization opportunities. In this work, we take a codesign approach of the NoC and cache organization. First, we propose a hybrid circuit/packet-switched NoC that exploits communication locality through periodic configuration of the most beneficial circuits. Second, we design a Unique Private (UP) caching scheme targeting the class of interconnects which exploit communication locality to improve communication latency. The Unique Private cache stores the data that are mostly accessed by each processor core in the core´s locally accessible cache bank, while leveraging dedicated high-speed circuits in the interconnect to provide remote cores with fast access to shared data. Simulations of a suite of scientific and commercial workloads show that our proposed design achieves a speedup of 15.2 and 14 percent on a 16-core and a 64-core CMP, respectively, over the state-of-the-art NoC-Cache codesigned system that also exploits communication locality in multithreaded applications.
Keywords :
cache storage; multiprocessing systems; network-on-chip; CMP; NoC design; cache organization; chip multiprocessors; data access latency; hybrid circuit/packet-switched NoC; memory hierarchy; multithreaded applications; network-on-chip designs; on-chip interconnect; unique private caching scheme; Cache memory; Distributed databases; Integrated circuit interconnections; Organizations; Routing; Switching circuits; System-on-a-chip; Multicore/single-chip multiprocessors; cache memories.; circuit-switching networks;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2011.238
Filename :
6030871
Link To Document :
بازگشت