DocumentCode :
2109005
Title :
Laziest write posting on bus-based SMPs
Author :
Lee, Yongjoon
Author_Institution :
New York City Tech. Coll., Brooklyn, NY, USA
Volume :
1
fYear :
2000
fDate :
2000
Firstpage :
21
Abstract :
Private caches are critical components for hiding memory access latency in high performance multiprocessor systems. However, it has been found that, when executing a parallel program, multiple processors may concurrently update a distinct portion of a cache line and cause unnecessary cache invalidation under traditional cache coherence protocols. Such invalidation can be delayed when software enforces a proper order of memory reads and writes using synchronization primitives. Although delaying cache invalidation until the next synchronization instruction avoids unnecessary coherence traffic, it still incurs additional overhead to invalidate and reconcile the inconsistent cache copies. In this paper, a deferred coherence model is presented, which extends the traditional coherence protocol with new partially-modified states to allow multiple writers to simultaneously update different portions of the same cache line. In addition, the proposed model separates the events of write notification and data reconciliation so that the updated data is posted only when another processor asks for the data. Furthermore, an efficient merging mechanism is incorporated to reconcile multiple inconsistent copies of a modified line upon accessing a potentially stale data. Execution-driven simulation of SPLASH-2 applications shows that the deferred coherence model can out-perform the traditional eager coherence model by up to 20%
Keywords :
cache storage; memory protocols; multiprocessing systems; performance evaluation; synchronisation; virtual machines; SPLASH-2 applications; bus-based symmetric multiprocessors; cache invalidation; coherence protocol; data reconciliation; deferred coherence model; execution-driven simulation; high performance multiprocessor systems; laziest write posting; memory access latency hiding; memory reads; memory writes; merging mechanism; parallel program; partially-modified states; private caches; software; synchronization primitives; write notification; Access protocols; Cities and towns; Coherence; Delay; Educational institutions; Microprocessors; Read-write memory; Software maintenance; Switched-mode power supply; System buses;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering, 2000 Canadian Conference on
Conference_Location :
Halifax, NS
ISSN :
0840-7789
Print_ISBN :
0-7803-5957-7
Type :
conf
DOI :
10.1109/CCECE.2000.849663
Filename :
849663
Link To Document :
بازگشت