• DocumentCode
    1478795
  • Title

    A Low-Overhead Asynchronous Interconnection Network for GALS Chip Multiprocessors

  • Author

    Horak, Michael N. ; Nowick, Steven M. ; Carlberg, Matthew ; Vishkin, Uzi

  • Author_Institution
    Adv. Simulation Technol., Inc., Herndon, VA, USA
  • Volume
    30
  • Issue
    4
  • fYear
    2011
  • fDate
    4/1/2011 12:00:00 AM
  • Firstpage
    494
  • Lastpage
    507
  • Abstract
    A new asynchronous interconnection network is introduced for globally-asynchronous locally-synchronous (GALS) chip multiprocessors. The network eliminates the need for global clock distribution, and can interface multiple synchronous timing domains operating at unrelated clock rates. In particular, two new highly-concurrent asynchronous components are introduced which provide simple routing and arbitration/merge functions. Post-layout simulations in identical commercial 90 nm technology indicate that comparable recent synchronous router nodes have 5.6-10.7 more energy per packet and 2.8-6.4 greater area than the new asynchronous nodes. Under random traffic, the network provides significantly lower latency and identical throughput over the entire operating range of the 800 MHz network and through mid-range traffic rates for the 1.36 GHz network, but with degradation at higher traffic rates. Preliminary evaluations are also presented for a mixed-timing (GALS) network in a shared-memory parallel architecture, running both random traffic and parallel benchmark kernels, as well as directions for further improvement.
  • Keywords
    circuit layout; clock distribution networks; microprocessor chips; multiprocessor interconnection networks; network routing; network-on-chip; parallel architectures; shared memory systems; GALS chip multiprocessor; clock distribution; frequency 1.36 GHz; frequency 800 MHz; globally-asynchronous locally-synchronous chip multiprocessor; interface multiple synchronous timing; low-overhead asynchronous interconnection network; mixed-timing network; network routing; parallel benchmark kernel; post-layout simulation; random traffic; shared-memory parallel architecture; size 90 nm; Latches; Multiprocessor interconnection; Pipelines; Registers; Routing; Throughput; Asynchronous design; GALS; low power; network-on-chip; parallel processing; transition-signaling;
  • fLanguage
    English
  • Journal_Title
    Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0278-0070
  • Type

    jour

  • DOI
    10.1109/TCAD.2011.2114970
  • Filename
    5737866