Title :
The Synchronization Power of Coalesced Memory Accesses
Author :
Phuong Hoai Ha ; Tsigas, Philippas ; Anshus, Otto J.
Author_Institution :
Dept. of Comput. Sci., Univ. of Tromso, Tromso, Norway
fDate :
7/1/2010 12:00:00 AM
Abstract :
Multicore architectures have established themselves as the new generation of computer architectures. As part of the one core to many cores evolution, memory access mechanisms have advanced rapidly. Several new memory access mechanisms have been implemented in many modern commodity multicore architectures. By specifying how processing cores access shared memory, memory access mechanisms directly influence the synchronization capabilities of multicore architectures. Therefore, it is crucial to investigate the synchronization power of these new memory access mechanisms. This paper investigates the synchronization power of coalesced memory accesses, a family of memory access mechanisms introduced in recent large multicore architectures such as the Compute Unified Device Architecture (CUDA). We first define three memory access models to capture the fundamental features of the new memory access mechanisms. Subsequently, we prove the exact synchronization power of these models in terms of their consensus numbers. These tight results show that the coalesced memory access mechanisms can facilitate strong synchronization between the threads of multicore architectures, without the need of synchronization primitives other than reads and writes. In the case of the contemporary CUDA processors, our results imply that the coalesced memory access mechanisms have consensus numbers up to 64.
Keywords :
parallel architectures; shared memory systems; synchronisation; coalesced memory access; compute unified device architecture; computer architecture; contemporary CUDA processor; memory access mechanism; memory access model; multicore architecture; shared memory; synchronization power; Memory access models; consensus; interprocess synchronization.; multicore architectures;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
DOI :
10.1109/TPDS.2009.135