DocumentCode :
44328
Title :
Smart: Single-Cycle Multihop Traversals over a Shared Network on Chip
Author :
Krishna, Tushar ; Chen, Chia-Hsin Owen ; Woo-Cheol Kwon ; Li-Shiuan Peh
Author_Institution :
Massachusetts Inst. of Technol., Cambridge, MA, USA
Volume :
34
Issue :
3
fYear :
2014
fDate :
May-June 2014
Firstpage :
43
Lastpage :
56
Abstract :
As the number of on-chip cores increases, scalable on-chip topologies such as meshes inevitably add multiple hops to each network traversal. The best practice today is to design one-cycle routers, such that the low-load network latency between a source and destination is equal to the number of routers and links (that is, twice the hops) between them. Designers of operating systems, compilers, and cache coherence protocols often try to limit communication to within a few hops because on-chip latency is critical for their scalability. In this article, the authors propose an on-chip network called Smart (Single-cycle Multihop Asynchronous Repeated Traversal) that aims to present a single-cycle datapath all the way from the source to the destination. They do not add any additional fast physical express links in the datapath; instead, they drive the shared crossbars and links asynchronously up to multiple hops within a single cycle. They designed a router and link microarchitecture to achieve such a traversal, and a flow-control technique to arbitrate and set up multihop paths within a cycle. A place-and-route design at 45 nm achieves 11 hops within a 1-GHz cycle for paths without turns (9 hops for paths with turns). The authors observe 5 to 8 times reduction in low-load latencies across synthetic traffic patterns on an 8×8 chip multiprocessor, compared to a baseline one-cycle router network. Full-system simulations with Splash-2 and Parsec benchmarks demonstrate 27 and 52 percent reduction in runtime for private and shared level-2 designs, respectively.
Keywords :
integrated circuit design; network routing; network-on-chip; Parsec benchmarks; Smart on-chip network; Splash-2; chip multiprocessor; flow-control technique; low-load latencies; microarchitecture links; multihop paths; one-cycle router network; place-and-route design; router design; runtime reduction; shared network on chip; single-cycle datapath; single-cycle multihop asynchronous repeated traversal; synthetic traffic patterns; Delays; Digital communication; Ports (Computers); Repeaters; Spread spectrum communication; System-on-chip; Delays; Digital communication; Ports (Computers); Repeaters; Spread spectrum communication; System-on-chip; high performance computing; multicore processor; network on chip; networking; repeated link; virtualization;
fLanguage :
English
Journal_Title :
Micro, IEEE
Publisher :
ieee
ISSN :
0272-1732
Type :
jour
DOI :
10.1109/MM.2014.48
Filename :
6828577
Link To Document :
بازگشت