DocumentCode :
1685992
Title :
Architecture optimizations for synchronization and communication on chip multiprocessors
Author :
Fide, Sevin ; Jenks, Stephen
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of California, Irvine, CA
fYear :
2008
Firstpage :
1
Lastpage :
8
Abstract :
Chip multiprocessors (CMPs) enable concurrent execution of multiple threads using several cores on a die. Current CMPs behave much like symmetric multiprocessors and do not take advantage of the proximity between cores to improve synchronization and communication between concurrent threads. Thread synchronization and communication instead use memory/cache interactions. We propose two architectural enhancements to support fine grain synchronization and communication between threads that reduce overhead and memory/cache contention. Register-based synchronization exploits the proximity between cores to provide low-latency shared registers for synchronization. This approach can save significant power over spin waiting when blocking events that suspend the core are used. Pre-pushing provides software controlled data forwarding between caches to reduce coherence traffic and improve cache latency and hit rates. We explore the behavior of these approaches, and evaluate their effectiveness at improving synchronization and communication performance on CMPs with private caches. Our simulation results show significant reduction in inter-core traffic, latencies, and miss rates.
Keywords :
cache storage; microprocessor chips; multi-threading; synchronisation; architecture optimizations; chip multiprocessors; communication performance; concurrent execution; low-latency shared registers; memory-cache interactions; multithreading; register-based synchronization; software controlled data forwarding; synchronization performance; Application software; Communication system traffic control; Computer architecture; Delay; Frequency synchronization; Hardware; Protocols; Registers; Traffic control; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
Conference_Location :
Miami, FL
ISSN :
1530-2075
Print_ISBN :
978-1-4244-1693-6
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2008.4536357
Filename :
4536357
Link To Document :
بازگشت