DocumentCode
467552
Title
Layout-Accurate Design and Implementation of a High-Throughput Interconnection Network for Single-Chip Parallel Processing
Author
Balkan, Aydin O. ; Horak, Michael N. ; Qu, Gang ; Vishkin, Uzi
Author_Institution
Univ. of Maryland Inst. for Adv. Comput. Studies (UMIACS), Baltimore
fYear
2007
fDate
22-24 Aug. 2007
Firstpage
21
Lastpage
28
Abstract
A mesh of trees (MoT) on-chip interconnection network has been proposed recently to provide high throughput between memory units and processors for single-chip parallel processing (Balkan et al., 2006). In this paper, we report our findings in bringing this concept to silicon. Specifically, we conduct cycle-accurate Verilog simulations to verify the analytical results claimed in (Balkan et al., 2006). We synthesize and obtain the layout of the MoT interconnection networks of various sizes. To further improve throughput, we investigate different arbitration primitives to handle load and store, the two most common memory operations. We also study the use of pipeline registers in large networks when there are long wires. Simulation based on full network layout demonstrates that significant throughput improvement can be achieved over the original proposed MoT interconnection network. The importance of this work lies in its validation of performance features of the MoT interconnection network, as they were previously shown to be competitive with traditional network solutions. The MoT network is currently used in an eXplicit multi-threading (XMT) on-chip parallel processor, which is engineered to support parallel programming. In that context, a 32-terminal MoT network could support up to 512 on-chip XMT processors. Our 8-terminal network that could serve 8 processor clusters (or 128 total processors), was also accepted recently for fabrication.
Keywords
hardware description languages; multi-threading; multiprocessor interconnection networks; parallel processing; MoT on-chip interconnection network; Verilog simulations; eXplicit multi-threading; layout-accurate design; mesh of trees on-chip interconnection network; on-chip XMT processors; on-chip parallel processor; parallel programming; pipeline registers; single-chip parallel processing; Analytical models; Hardware design languages; Multiprocessor interconnection networks; Network synthesis; Network-on-a-chip; Parallel processing; Pipelines; Registers; Silicon; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
High-Performance Interconnects, 2007. HOTI 2007. 15th Annual IEEE Symposium on
Conference_Location
Stanford, CA
ISSN
1550-4794
Print_ISBN
978-0-7695-2979-0
Type
conf
DOI
10.1109/HOTI.2007.11
Filename
4296804
Link To Document