DocumentCode :
1358875
Title :
The Synchronization Power of Coalesced Memory Accesses
Author :
Phuong Hoai Ha ; Tsigas, Philippas ; Anshus, Otto J.
Author_Institution :
Dept. of Comput. Sci., Univ. of Tromso, Tromso, Norway
Volume :
21
Issue :
7
fYear :
2010
fDate :
7/1/2010 12:00:00 AM
Firstpage :
939
Lastpage :
953
Abstract :
Multicore architectures have established themselves as the new generation of computer architectures. As part of the one core to many cores evolution, memory access mechanisms have advanced rapidly. Several new memory access mechanisms have been implemented in many modern commodity multicore architectures. By specifying how processing cores access shared memory, memory access mechanisms directly influence the synchronization capabilities of multicore architectures. Therefore, it is crucial to investigate the synchronization power of these new memory access mechanisms. This paper investigates the synchronization power of coalesced memory accesses, a family of memory access mechanisms introduced in recent large multicore architectures such as the Compute Unified Device Architecture (CUDA). We first define three memory access models to capture the fundamental features of the new memory access mechanisms. Subsequently, we prove the exact synchronization power of these models in terms of their consensus numbers. These tight results show that the coalesced memory access mechanisms can facilitate strong synchronization between the threads of multicore architectures, without the need of synchronization primitives other than reads and writes. In the case of the contemporary CUDA processors, our results imply that the coalesced memory access mechanisms have consensus numbers up to 64.
Keywords :
parallel architectures; shared memory systems; synchronisation; coalesced memory access; compute unified device architecture; computer architecture; contemporary CUDA processor; memory access mechanism; memory access model; multicore architecture; shared memory; synchronization power; Memory access models; consensus; interprocess synchronization.; multicore architectures;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2009.135
Filename :
5226617
Link To Document :
بازگشت