DocumentCode :
467552
Title :
Layout-Accurate Design and Implementation of a High-Throughput Interconnection Network for Single-Chip Parallel Processing
Author :
Balkan, Aydin O. ; Horak, Michael N. ; Qu, Gang ; Vishkin, Uzi
Author_Institution :
Univ. of Maryland Inst. for Adv. Comput. Studies (UMIACS), Baltimore
fYear :
2007
fDate :
22-24 Aug. 2007
Firstpage :
21
Lastpage :
28
Abstract :
A mesh of trees (MoT) on-chip interconnection network has been proposed recently to provide high throughput between memory units and processors for single-chip parallel processing (Balkan et al., 2006). In this paper, we report our findings in bringing this concept to silicon. Specifically, we conduct cycle-accurate Verilog simulations to verify the analytical results claimed in (Balkan et al., 2006). We synthesize and obtain the layout of the MoT interconnection networks of various sizes. To further improve throughput, we investigate different arbitration primitives to handle load and store, the two most common memory operations. We also study the use of pipeline registers in large networks when there are long wires. Simulation based on full network layout demonstrates that significant throughput improvement can be achieved over the original proposed MoT interconnection network. The importance of this work lies in its validation of performance features of the MoT interconnection network, as they were previously shown to be competitive with traditional network solutions. The MoT network is currently used in an eXplicit multi-threading (XMT) on-chip parallel processor, which is engineered to support parallel programming. In that context, a 32-terminal MoT network could support up to 512 on-chip XMT processors. Our 8-terminal network that could serve 8 processor clusters (or 128 total processors), was also accepted recently for fabrication.
Keywords :
hardware description languages; multi-threading; multiprocessor interconnection networks; parallel processing; MoT on-chip interconnection network; Verilog simulations; eXplicit multi-threading; layout-accurate design; mesh of trees on-chip interconnection network; on-chip XMT processors; on-chip parallel processor; parallel programming; pipeline registers; single-chip parallel processing; Analytical models; Hardware design languages; Multiprocessor interconnection networks; Network synthesis; Network-on-a-chip; Parallel processing; Pipelines; Registers; Silicon; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Interconnects, 2007. HOTI 2007. 15th Annual IEEE Symposium on
Conference_Location :
Stanford, CA
ISSN :
1550-4794
Print_ISBN :
978-0-7695-2979-0
Type :
conf
DOI :
10.1109/HOTI.2007.11
Filename :
4296804
Link To Document :
بازگشت