• DocumentCode
    1190961
  • Title

    Switch MSHR: a technique to reduce remote read memory access time in CC-NUMA multiprocessors

  • Author

    Bhuyan, Laxmi Narayan ; Wang, Hujun

  • Author_Institution
    Comput. Sci. & Eng. Dept., California Univ., Riverside, CA, USA
  • Volume
    52
  • Issue
    5
  • fYear
    2003
  • fDate
    5/1/2003 12:00:00 AM
  • Firstpage
    617
  • Lastpage
    632
  • Abstract
    A remote memory access poses a severe problem for the design of CC-NUMA multiprocessors because it takes an order of magnitude longer than the local memory access. The large latency arises partly due to the increased distance between the processor and remote memory over the interconnection network. In this paper, we develop a new switch architecture, called Switch MSHR (SMSHR), which provides the cache block to the requesting processors without those requests having to go to the home memory. The SMSHR idea is based on providing a few miss status holding registers (MSHRs) in each switch that keep track of read requests to the memory. The SMSHR blocks secondary requests to the same memory block and provides them with a copy of the block when the primary reply returns. The SMSHR design is then extended to include a switch cache, which can temporarily save a copy of the data block for later use. We provide basic block designs for the SMSHR and SIVISHR+cache architectures in this paper. We explore the design space by modeling the new switch architectures in a detailed execution-driven simulator and analyze the performance benefits. Our Simulation results show that applications with a high degree of data sharing benefit tremendously from the SMSHR and SMSHR+cache techniques.
  • Keywords
    cache storage; memory architecture; multiprocessor interconnection networks; performance evaluation; virtual machines; CC-NUMA multiprocessors; Switch MSHR; cache block; data sharing; design space; execution-driven simulator; interconnection network; latency; miss status holding registers; modeling; performance benefits; remote memory access; remote read memory access time reduction; requesting processors; switch architecture; Analytical models; Bandwidth; Delay; Multiprocessor interconnection networks; Network servers; Performance analysis; Registers; Space exploration; Switches; Telecommunication traffic;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2003.1197128
  • Filename
    1197128