• DocumentCode
    2572924
  • Title

    The use of prediction for accelerating upgrade misses in cc-NUMA multiprocessors

  • Author

    Acacio, Manuel E. ; Gonzalez, Jose ; García, José M. ; Duato, José

  • Author_Institution
    Univ. de Murcia, Spain
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    155
  • Lastpage
    164
  • Abstract
    This work is focused on accelerating upgrade misses in cc-NUMA multiprocessors. These misses are caused by store instructions for which a read-only copy of the line is found in the L2 cache. Upgrade misses require a message sent from the missing node to the directory, a directory lookup in order to find the set of sharers, invalidation messages being sent to the sharers and responses to the invalidations being sent back. Therefore, the penalty paid by these misses is not negligible, mainly if we consider that they account for a high percentage of the total miss rate. We propose the use of prediction as a means of providing cc-NUMA multiprocessors with a more efficient support for upgrade misses by directly invalidating sharers from the missing node. Our proposal comprises an effective prediction scheme achieving high hit rates as well as a coherence protocol extended to support the use of prediction. Our work is motivated by two key observations: first, upgrade misses present a repetitive behavior and, second, the total number of sharers being invalidated is small (one, in some cases). Using execution-driven simulations, we show that the use of prediction can significantly accelerate upgrade misses (latency reductions of more than 40% in some cases). These important improvements translate into speed-ups on application performance up to 14%. Finally, these results can be obtained including a predictor with a total size of less than 48 KB in every node.
  • Keywords
    cache storage; delays; memory protocols; shared memory systems; L2 cache; cc-NUMA multiprocessors; coherence protocol; direct invalidation; execution-driven simulations; latency reductions; prediction; repetitive behavior; sharers; upgrade miss acceleration; Acceleration; Access protocols; Coherence; Delay; Hardware; Out of order; Predictive models; Proposals;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Architectures and Compilation Techniques, 2002. Proceedings. 2002 International Conference on
  • ISSN
    1089-795X
  • Print_ISBN
    0-7695-1620-3
  • Type

    conf

  • DOI
    10.1109/PACT.2002.1106014
  • Filename
    1106014