• DocumentCode
    3539890
  • Title

    SCD: A scalable coherence directory with flexible sharer set encoding

  • Author

    Sanchez, Daniel ; Kozyrakis, Christos

  • fYear
    2012
  • fDate
    25-29 Feb. 2012
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    Large-scale CMPs with hundreds of cores require a directory-based protocol to maintain cache coherence. However, previously proposed coherence directories are hard to scale beyond tens of cores, requiring either excessive area or energy, complex hierarchical protocols, or inexact representations of sharer sets that increase coherence traffic and degrade performance. We present SCD, a scalable coherence directory that relies on efficient highly-associative caches (such as zcaches) to implement a single-level directory that scales to thousands of cores, tracks sharer sets exactly, and incurs negligible directory-induced invalidations. SCD scales because, unlike conventional directories, it uses a variable number of directory tags to represent sharer sets: lines with one or few sharers use a single tag, while widely shared lines use additional tags, so tags remain small as the system scales up. We show that, thanks to the efficient highly-associative array it relies on, SCD can be fully characterized using analytical models, and can be sized to guarantee a negligible number of evictions independently of the workload. We evaluate SCD using simulations of a 1024-core CMP. For the same level of coverage, we find that SCD is 13× more area-efficient than full-map sparse directories, and 2× more area-efficient and faster than hierarchical directories, while requiring a simpler protocol. Furthermore, we show that SCD´s analytical models are accurate in practice.
  • Keywords
    cache storage; encoding; microprocessor chips; multiprocessing systems; protocols; 1024-core CMP; SCD; analytical models; cache coherence; chip-multiprocessors; coherence traffic; complex hierarchical protocols; directory-based protocol; excessive area; excessive energy; flexible sharer set encoding; large-scale CMP; negligible directory-induced invalidations; scalable coherence directory; sharer set inexact representations; single-level directory; than full-map sparse directories; zcaches; Analytical models; Arrays; Coherence; Indexes; Organizations; Protocols; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computer Architecture (HPCA), 2012 IEEE 18th International Symposium on
  • Conference_Location
    New Orleans, LA
  • ISSN
    1530-0897
  • Print_ISBN
    978-1-4673-0827-4
  • Electronic_ISBN
    1530-0897
  • Type

    conf

  • DOI
    10.1109/HPCA.2012.6168950
  • Filename
    6168950