• DocumentCode
    2193143
  • Title

    VLSI micro-architectures for high-radix crossbar schedulers

  • Author

    Passas, Giorgos ; Katevenis, Manolis ; Pnevmatikatos, Dionisios

  • Author_Institution
    Inst. of Comput. Sci. (ICS), Found. for Res. & Technol.-Hellas (FORTH), Heraklion, Greece
  • fYear
    2011
  • fDate
    1-4 May 2011
  • Firstpage
    217
  • Lastpage
    224
  • Abstract
    We study the scaling of parallel-matching crossbar schedulers to radices above 100. First, we examine a traditional microarchitecture that implements the matching decision of each input and each output of the crossbar in a separate arbiter block and communicates the matching decisions between the input and the output arbiters through global point-to-point links. Using simple models and experimentation with 90nm CMOS layouts, we show that this architecture is expensive because the global point-to-point links take up O(N4) area, where N the radix of the crossbar. Next, by observing that the wiring of an arbiter fits in a minimal O(NlogN) area, we propose a novel microarchitecture that inverts the locality of wires by orthogonally interleaving the input with the output arbiters, thus lowering the wiring area of the scheduler down to O(N2log2N). Using this architecture, the scheduler for a radix-128 FIFO, VOQ, or 2-VC crossbar becomes gate limited, fitting in 3.6, 7.2, and 7.2mm2 respectively, which is a 40, 50, and 70% improvement compared to the traditional. Moreover, the proposed schedulers find a new match in less than 10ns, thus allowing a minimum packet below 30Bytes at 24Gb/s line rate. Based on these findings, we conclude that crossbar schedulers are feasible even for radices above 100.
  • Keywords
    CMOS logic circuits; VLSI; asynchronous circuits; digital arithmetic; integrated circuit layout; multiprocessor interconnection networks; network-on-chip; 2-VC crossbar; CMOS layouts; VLSI microarchitecture; VOQ; arbiter block; high-radix crossbar scheduler; matching decision; orthogonal interleaver; parallel-matching crossbar scheduler; point to point links; radix-128 FIFO; size 90 nm; virtual channel; virtual output queues; Algorithm design and analysis; Layout; ASICs; Crossbar; Parallel Iterative Matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networks on Chip (NoCS), 2011 Fifth IEEE/ACM International Symposium on
  • Conference_Location
    Pittsburgh, PA
  • Electronic_ISBN
    978-1-4503-0720-8
  • Type

    conf

  • Filename
    5948567