• DocumentCode
    3588629
  • Title

    Directory Lookaside Table: Enabling scalable, low-conflict, many-core cache coherence directory

  • Author

    Xudong Shi ; Feiqi Su ; Jih-Kwon Peir

  • Author_Institution
    Google Inc., Mountain View, CA, USA
  • fYear
    2014
  • Firstpage
    111
  • Lastpage
    118
  • Abstract
    Maintaining hardware cache coherence on future CMPs becomes increasingly important and difficult as the number of cores keeps accelerating in mainstream multicore chips. The simple snooping-bus coherence scheme is not suitable due to its limited scalability. The sparse coherence directory approach may incur extra cache invalidations due to a topological mismatch between the coherence directory and the directories of all cache modules. In this paper, we propose an innovative CMP coherence directory that has three important properties. First, the directory has a simple set-associative design with small associativity. The number of directory entries matches the total number of cache blocks. Second, an augmented Directory Lookaside Table (DLT) allows blocks to be displaced from their primary sets in the coherence directory for alleviating hot-set conflicts. Third, to avoid expensive presence bits, each copy of a block along with the located core ID occupies a separate directory entry. Performance evaluations based on multithreaded and multi-programmed workloads demonstrate significant advantages of the proposed CMP directory over directories with traditional set-associative or skewed associative designs.
  • Keywords
    cache storage; multi-threading; multiprocessing systems; CMP coherence directory; DLT; directory lookaside table; hardware cache coherence; many-core cache coherence directory; multicore chip; multiprogrammed workload; multithreaded workload; set-associative design; skewed associative design; Arrays; Coherence; Indexes; Multicore processing; Organizations; Performance evaluation; Protocols; CMP; Cache coherence; Directory-based protocol; Snooping-bus protocol; Sparse directory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems (ICPADS), 2014 20th IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/PADSW.2014.7097798
  • Filename
    7097798