DocumentCode :
3539890
Title :
SCD: A scalable coherence directory with flexible sharer set encoding
Author :
Sanchez, Daniel ; Kozyrakis, Christos
fYear :
2012
fDate :
25-29 Feb. 2012
Firstpage :
1
Lastpage :
12
Abstract :
Large-scale CMPs with hundreds of cores require a directory-based protocol to maintain cache coherence. However, previously proposed coherence directories are hard to scale beyond tens of cores, requiring either excessive area or energy, complex hierarchical protocols, or inexact representations of sharer sets that increase coherence traffic and degrade performance. We present SCD, a scalable coherence directory that relies on efficient highly-associative caches (such as zcaches) to implement a single-level directory that scales to thousands of cores, tracks sharer sets exactly, and incurs negligible directory-induced invalidations. SCD scales because, unlike conventional directories, it uses a variable number of directory tags to represent sharer sets: lines with one or few sharers use a single tag, while widely shared lines use additional tags, so tags remain small as the system scales up. We show that, thanks to the efficient highly-associative array it relies on, SCD can be fully characterized using analytical models, and can be sized to guarantee a negligible number of evictions independently of the workload. We evaluate SCD using simulations of a 1024-core CMP. For the same level of coverage, we find that SCD is 13× more area-efficient than full-map sparse directories, and 2× more area-efficient and faster than hierarchical directories, while requiring a simpler protocol. Furthermore, we show that SCD´s analytical models are accurate in practice.
Keywords :
cache storage; encoding; microprocessor chips; multiprocessing systems; protocols; 1024-core CMP; SCD; analytical models; cache coherence; chip-multiprocessors; coherence traffic; complex hierarchical protocols; directory-based protocol; excessive area; excessive energy; flexible sharer set encoding; large-scale CMP; negligible directory-induced invalidations; scalable coherence directory; sharer set inexact representations; single-level directory; than full-map sparse directories; zcaches; Analytical models; Arrays; Coherence; Indexes; Organizations; Protocols; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computer Architecture (HPCA), 2012 IEEE 18th International Symposium on
Conference_Location :
New Orleans, LA
ISSN :
1530-0897
Print_ISBN :
978-1-4673-0827-4
Electronic_ISBN :
1530-0897
Type :
conf
DOI :
10.1109/HPCA.2012.6168950
Filename :
6168950
Link To Document :
بازگشت