DocumentCode :
2523287
Title :
Locality vs. Balance: Exploring Data Mapping Policies on NUMA Systems
Author :
Diener, Matthias ; Cruz, Eduardo H. M. ; Navaux, Philippe O. A.
Author_Institution :
Inf. Inst., Fed. Univ. of Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
fYear :
2015
fDate :
4-6 March 2015
Firstpage :
9
Lastpage :
16
Abstract :
In parallel architectures that have a Non-Uniform Memory Access (NUMA) behavior, the mapping of memory pages to NUMA nodes influences the performance of parallel applications. In order to improve traditional data mapping policies, two basic strategies can be employed: optimizing locality or balance of memory accesses. In a locality-based policy, memory pages are mapped to nodes that access the page the most. In a balance-based policy, memory pages are mapped such that the number of memory accesses resolved by each memory controller is similar. In this paper, we perform an in-depth exploration of these data mapping policies on the performance of parallel applications. We introduce metrics that describe their memory access behavior and evaluate their suitability for data mapping. We also present new mapping policies that focus on locality, balance or both. These policies were evaluated on three different NUMA architectures with applications from the NAS-OMP and PARSEC benchmark suites. Results show that the performance improvements of each policy depend on the characteristics of the applications and machines. Choosing the wrong policy can actually hurt the performance compared to the default first-touch mapping. Compared to traditional mapping policies and to policies that only focus on either locality or balance, taking into account both locality and balance results in the highest improvements. Furthermore, it avoids the performance reduction caused by the wrong data mapping.
Keywords :
data handling; parallel architectures; NUMA nodes; NUMA systems; data mapping; data mapping policies; first-touch mapping; memory controller; memory pages; nonuniform memory access; parallel applications; parallel architectures; Benchmark testing; Instruction sets; Linux; Mathematical model; Measurement; Memory management; Data mapping; Load balance; Locality; NUMA;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2015 23rd Euromicro International Conference on
Conference_Location :
Turku
ISSN :
1066-6192
Type :
conf
DOI :
10.1109/PDP.2015.11
Filename :
7092693
Link To Document :
بازگشت