• DocumentCode
    1484618
  • Title

    Extending Magny-Cours Cache Coherence

  • Author

    Ros, Alberto ; Cuesta, Blas ; Fernández-Pascual, Ricardo ; Gómez, María E. ; Acacio, Manuel E. ; Robles, Antonio ; García, José M. ; Duato, José

  • Author_Institution
    Dept. de Informdtica de Sist. y Comput., Univ. Politec. de Valencia, València, Spain
  • Volume
    61
  • Issue
    5
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    593
  • Lastpage
    606
  • Abstract
    One cost-effective way to meet the increasing demand for larger high-performance shared-memory servers is to build clusters with off-the-shelf processors connected with low-latency point-to-point interconnections like HyperTransport. Unfortunately, HyperTransport addressing limitations prevent building systems with more than eight nodes. While the recent High-Node Count HyperTransport specification overcomes this limitation, recently launched twelve-core Magny-Cours processors have already inherited it and provide only 3 bits to encode the pointers used by the directory cache which they include to increase the scalability of their coherence protocol. In this work, we propose and develop an external device to extend the coherence domain of Magny-Cours processors beyond the 8-node limit while maintaining the advantages provided by the directory cache. Evaluation results for systems with up to 32 nodes show that the performance offered by our solution scales with the number of nodes, enhancing the directory cache effectiveness by filtering additional messages. Particularly, we reduce execution time by 47 percent in a 32-die system with respect to the 8-die Magny-Cours configuration.
  • Keywords
    Internet; cache storage; microprocessor chips; protocols; shared memory systems; Internet; arger high-performance shared-memory servers; coherence protocol; directory cache; hypertransport; low-latency point-to-point interconnections; magny-cours cache coherence; off-the-shelf processors; traffic filtering; twelve-core magny-cours processors; Coherence; Probes; Program processors; Proposals; Protocols; Scalability; Servers; High-performance computing; cache coherence; coherence extension; directory protocol; scalability; shared memory; traffic filtering.;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2011.65
  • Filename
    5740853