Title :
Optimizing thread-to-core mapping on manycore platforms with distributed Tag Directories
Author :
Guantao Liu ; Schmidt, Tim ; Domer, Rainer ; Dingankar, Ajit ; Kirkpatrick, Desmond
Author_Institution :
Univ. of California, Irvine, Irvine, CA, USA
Abstract :
With the increasing demand for parallel computing power, manycore platforms are attracting more and more attention due to their potential to improve performance and scalability of parallel applications. However, as the core count increases, core-to-core communication becomes expensive. For manycore architectures using directory-based cache coherence protocols, the core-to-core communication latency depends not only on the physical placement on the chip, but also on the location of the distributed cache tag directory. In this paper, we first define the concept of core distance for multicore and manycore architectures. Using a ping-pong spin-lock benchmark, we quantify the core distance on a ring-network platform and propose an approach to optimize thread-to-core mapping in order to minimize on-chip communication overhead. In our experiments, our approach speeds up communication-intensive benchmarks by more than 25% on average over the Linux default mapping strategy.
Keywords :
cache storage; multi-threading; multiprocessing systems; parallel architectures; communication-intensive benchmarks; core distance; core-to-core communication latency; directory-based cache coherence protocols; distributed cache tag directory; distributed tag directories; manycore architectures; manycore platforms; multicore architecture; on-chip communication overhead; parallel computing power; physical placement; ping-pong spin-lock benchmark; ring-network platform; thread-to-core mapping optimization; Benchmark testing; Coprocessors; Instruction sets; Multicore processing; Optimization;
Conference_Titel :
Design Automation Conference (ASP-DAC), 2015 20th Asia and South Pacific
Conference_Location :
Chiba
Print_ISBN :
978-1-4799-7790-1
DOI :
10.1109/ASPDAC.2015.7059044