DocumentCode
3588629
Title
Directory Lookaside Table: Enabling scalable, low-conflict, many-core cache coherence directory
Author
Xudong Shi ; Feiqi Su ; Jih-Kwon Peir
Author_Institution
Google Inc., Mountain View, CA, USA
fYear
2014
Firstpage
111
Lastpage
118
Abstract
Maintaining hardware cache coherence on future CMPs becomes increasingly important and difficult as the number of cores keeps accelerating in mainstream multicore chips. The simple snooping-bus coherence scheme is not suitable due to its limited scalability. The sparse coherence directory approach may incur extra cache invalidations due to a topological mismatch between the coherence directory and the directories of all cache modules. In this paper, we propose an innovative CMP coherence directory that has three important properties. First, the directory has a simple set-associative design with small associativity. The number of directory entries matches the total number of cache blocks. Second, an augmented Directory Lookaside Table (DLT) allows blocks to be displaced from their primary sets in the coherence directory for alleviating hot-set conflicts. Third, to avoid expensive presence bits, each copy of a block along with the located core ID occupies a separate directory entry. Performance evaluations based on multithreaded and multi-programmed workloads demonstrate significant advantages of the proposed CMP directory over directories with traditional set-associative or skewed associative designs.
Keywords
cache storage; multi-threading; multiprocessing systems; CMP coherence directory; DLT; directory lookaside table; hardware cache coherence; many-core cache coherence directory; multicore chip; multiprogrammed workload; multithreaded workload; set-associative design; skewed associative design; Arrays; Coherence; Indexes; Multicore processing; Organizations; Performance evaluation; Protocols; CMP; Cache coherence; Directory-based protocol; Snooping-bus protocol; Sparse directory;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems (ICPADS), 2014 20th IEEE International Conference on
Type
conf
DOI
10.1109/PADSW.2014.7097798
Filename
7097798
Link To Document