• DocumentCode
    2109005
  • Title

    Laziest write posting on bus-based SMPs

  • Author

    Lee, Yongjoon

  • Author_Institution
    New York City Tech. Coll., Brooklyn, NY, USA
  • Volume
    1
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    21
  • Abstract
    Private caches are critical components for hiding memory access latency in high performance multiprocessor systems. However, it has been found that, when executing a parallel program, multiple processors may concurrently update a distinct portion of a cache line and cause unnecessary cache invalidation under traditional cache coherence protocols. Such invalidation can be delayed when software enforces a proper order of memory reads and writes using synchronization primitives. Although delaying cache invalidation until the next synchronization instruction avoids unnecessary coherence traffic, it still incurs additional overhead to invalidate and reconcile the inconsistent cache copies. In this paper, a deferred coherence model is presented, which extends the traditional coherence protocol with new partially-modified states to allow multiple writers to simultaneously update different portions of the same cache line. In addition, the proposed model separates the events of write notification and data reconciliation so that the updated data is posted only when another processor asks for the data. Furthermore, an efficient merging mechanism is incorporated to reconcile multiple inconsistent copies of a modified line upon accessing a potentially stale data. Execution-driven simulation of SPLASH-2 applications shows that the deferred coherence model can out-perform the traditional eager coherence model by up to 20%
  • Keywords
    cache storage; memory protocols; multiprocessing systems; performance evaluation; synchronisation; virtual machines; SPLASH-2 applications; bus-based symmetric multiprocessors; cache invalidation; coherence protocol; data reconciliation; deferred coherence model; execution-driven simulation; high performance multiprocessor systems; laziest write posting; memory access latency hiding; memory reads; memory writes; merging mechanism; parallel program; partially-modified states; private caches; software; synchronization primitives; write notification; Access protocols; Cities and towns; Coherence; Delay; Educational institutions; Microprocessors; Read-write memory; Software maintenance; Switched-mode power supply; System buses;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 2000 Canadian Conference on
  • Conference_Location
    Halifax, NS
  • ISSN
    0840-7789
  • Print_ISBN
    0-7803-5957-7
  • Type

    conf

  • DOI
    10.1109/CCECE.2000.849663
  • Filename
    849663