مرکز منطقه ای اطلاع رساني علوم و فناوري - VLSI micro-architectures for high-radix crossbar schedulers

DocumentCode :

2193143

Title :

VLSI micro-architectures for high-radix crossbar schedulers

Author :

Passas, Giorgos ; Katevenis, Manolis ; Pnevmatikatos, Dionisios

Author_Institution :

Inst. of Comput. Sci. (ICS), Found. for Res. & Technol.-Hellas (FORTH), Heraklion, Greece

fYear :

2011

fDate :

1-4 May 2011

Firstpage :

217

Lastpage :

224

Abstract :

We study the scaling of parallel-matching crossbar schedulers to radices above 100. First, we examine a traditional microarchitecture that implements the matching decision of each input and each output of the crossbar in a separate arbiter block and communicates the matching decisions between the input and the output arbiters through global point-to-point links. Using simple models and experimentation with 90nm CMOS layouts, we show that this architecture is expensive because the global point-to-point links take up O(N⁴) area, where N the radix of the crossbar. Next, by observing that the wiring of an arbiter fits in a minimal O(NlogN) area, we propose a novel microarchitecture that inverts the locality of wires by orthogonally interleaving the input with the output arbiters, thus lowering the wiring area of the scheduler down to O(N²log²N). Using this architecture, the scheduler for a radix-128 FIFO, VOQ, or 2-VC crossbar becomes gate limited, fitting in 3.6, 7.2, and 7.2mm² respectively, which is a 40, 50, and 70% improvement compared to the traditional. Moreover, the proposed schedulers find a new match in less than 10ns, thus allowing a minimum packet below 30Bytes at 24Gb/s line rate. Based on these findings, we conclude that crossbar schedulers are feasible even for radices above 100.

Keywords :

CMOS logic circuits; VLSI; asynchronous circuits; digital arithmetic; integrated circuit layout; multiprocessor interconnection networks; network-on-chip; 2-VC crossbar; CMOS layouts; VLSI microarchitecture; VOQ; arbiter block; high-radix crossbar scheduler; matching decision; orthogonal interleaver; parallel-matching crossbar scheduler; point to point links; radix-128 FIFO; size 90 nm; virtual channel; virtual output queues; Algorithm design and analysis; Layout; ASICs; Crossbar; Parallel Iterative Matching;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Networks on Chip (NoCS), 2011 Fifth IEEE/ACM International Symposium on

Conference_Location :

Pittsburgh, PA

Electronic_ISBN :

978-1-4503-0720-8

Type :

conf

Filename :

5948567

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2193143