• DocumentCode
    1442690
  • Title

    A Direct Coherence Protocol for Many-Core Chip Multiprocessors

  • Author

    Ros, A. ; Acacio, M.E. ; Garcia, J.M.

  • Author_Institution
    Dept. de Ing. y Tecnol. de Comput., Univ. de Murcia, Murcia, Spain
  • Volume
    21
  • Issue
    12
  • fYear
    2010
  • Firstpage
    1779
  • Lastpage
    1792
  • Abstract
    Future many-core CMP designs that will integrate tens of processor cores on-chip will be constrained by area and power. Area constraints make impractical the use of a bus or a crossbar as the on-chip interconnection network, and tiled CMPs organized around a direct interconnection network will probably be the architecture of choice. Power constraints make impractical to rely on broadcasts (as, for example, Token-CMP does) or any other brute-force method for keeping cache coherence, and directory-based cache coherence protocols are currently being employed. Unfortunately, directory protocols introduce indirection to access directory information, which negatively impacts performance. In this work, we present DiCo-CMP, a novel cache coherence protocol especially suited to future many-core tiled CMP architectures. In DiCo-CMP, the task of storing up-to-date sharing information and ensuring ordered accesses for every memory block is assigned to the cache that must provide the block on a miss. Therefore, DiCo-CMP reduces the miss latency compared to a directory protocol by sending requests directly to the cache that provides the block in a cache miss. These latency reductions result in improvements in execution time of up to 6 percent, on average, over a directory protocol. In comparison with Token-CMP, our protocol only sends one request message for each cache miss, as such is able to reduce network traffic by 43 percent.
  • Keywords
    cache storage; coherence; microprocessor chips; multiprocessor interconnection networks; protocols; DiCo-CMP; area constraints; brute-force method; direct interconnection network; directory-based cache coherence protocols; many-core chip multiprocessors; many-core tiled CMP architectures; on-chip interconnection network; token-CMP; Access protocols; Broadcasting; Coherence; Delay; Microprocessors; Multiprocessor interconnection networks; Network-on-a-chip; Telecommunication traffic; Many-core CMP; cache coherence protocol; direct coherence; indirection problem; on-chip network traffic.;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2010.43
  • Filename
    5432165