• DocumentCode
    3326826
  • Title

    Managing shared last-level cache in a heterogeneous multicore processor

  • Author

    Gwangsun Kim ; Kim, John ; Jung Ho Ahn ; Jaeha Kim

  • Author_Institution
    KAIST, Daejeon, South Korea
  • fYear
    2013
  • fDate
    7-11 Sept. 2013
  • Firstpage
    225
  • Lastpage
    234
  • Abstract
    Memory bandwidth has been one of the most critical system performance bottlenecks. As a result, the HMC (Hybrid Memory Cube) has recently been proposed to improve DRAM bandwidth as well as energy efficiency. In this paper, we explore different system interconnect designs with HMCs. We show that processor-centric network architectures cannot fully utilize processor bandwidth across different traffic patterns. Thus, we propose a memory-centric network in which all processor channels are connected to HMCs and not to any other processors as all communication between processors goes through intermediate HMCs. Since there are multiple HMCs per processor, we propose a distributor-based network to reduce the network diameter and achieve lower latency while properly distributing the bandwidth across different routers and providing path diversity. Memory-centric networks lead to some challenges including higher processor-to-processor latency and the need to properly exploit the path diversity. We propose a pass-through microarchitecture, which, in combination with the proper intra-HMC organization, reduces the zero-load latency while exploiting adaptive (and non-minimal) routing to load-balance across different channels. Our results show that memory-centric networks can efficiently utilize processor bandwidth for different traffic patterns and achieve higher performance by providing higher memory bandwidth and lower latency.
  • Keywords
    DRAM chips; integrated circuit design; performance evaluation; DRAM bandwidth; HMC; energy efficiency; hybrid memory cubes; memory bandwidth; memory centric network; memory centric system interconnect design; network diameter; processor bandwidth; processor centric network architectures; processor channels; traffic patterns; Bandwidth; Organizations; Ports (Computers); Random access memory; Routing; Switches; Topology; cache management policy; heterogeneous multicores; shared last-level cache;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Architectures and Compilation Techniques (PACT), 2013 22nd International Conference on
  • Conference_Location
    Edinburgh
  • ISSN
    1089-795X
  • Print_ISBN
    978-1-4799-1018-2
  • Type

    conf

  • DOI
    10.1109/PACT.2013.6618812
  • Filename
    6618812