• DocumentCode
    246572
  • Title

    Cache Coherence Method for Improving Multi-threaded Applications on Multicore Systems

  • Author

    Sun Sun ; Hong An ; Junshi Chen

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2014
  • fDate
    20-23 Dec. 2014
  • Firstpage
    47
  • Lastpage
    50
  • Abstract
    Chip-multiprocessors (CMPs) have become the mainstream parallel architecture in enterprise and scientific computing facilities. For scalability reasons, design with larger core counts tends towards with physically distributed hardware caches. This naturally results in a Non-Uniform Cache Access design, where data movement and management impacts access latency and consume power. In this work, we observed that shared data writing behavior dramatically wastes precious on-chip hardware cache resource and seriously affects the whole system performance due to the high remote access latency. Therefore, we propose a new prediction mechanism to predict the impact of shared data and a directory-based MESI cache coherence protocol with selective write-shared-data-update transition strategy instead of native write-invalidate strategy. We evaluate our proposal on a modern multi-core machine with NAS Parallel Benchmarks. Experimental results showed speedup gains of up to 21% opposed to the native write-invalidate transition strategy.
  • Keywords
    cache storage; multi-threading; multiprocessing systems; parallel architectures; CMP; NAS Parallel Benchmarks; chip-multiprocessors; data management; data movement; directory-based MESI cache coherence protocol; enterprise computing facilities; mainstream parallel architecture; multicore machine; nonuniform cache access design; on-chip hardware cache resource; physically distributed hardware caches; remote access latency; scientific computing facilities; shared data writing behavior; write-invalidate strategy; write-shared-data-update transition strategy; Benchmark testing; Coherence; Instruction sets; Multicore processing; Protocols; Radiation detectors; Cache coherence protocol; Chip-multiprocessors; Non-Uniform cache access; Shared data; Write invalidate;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia, Computer Graphics and Broadcasting (MulGraB), 2014 6th International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-1-4799-7763-5
  • Type

    conf

  • DOI
    10.1109/MulGraB.2014.18
  • Filename
    7024316