• DocumentCode
    2523287
  • Title

    Locality vs. Balance: Exploring Data Mapping Policies on NUMA Systems

  • Author

    Diener, Matthias ; Cruz, Eduardo H. M. ; Navaux, Philippe O. A.

  • Author_Institution
    Inf. Inst., Fed. Univ. of Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
  • fYear
    2015
  • fDate
    4-6 March 2015
  • Firstpage
    9
  • Lastpage
    16
  • Abstract
    In parallel architectures that have a Non-Uniform Memory Access (NUMA) behavior, the mapping of memory pages to NUMA nodes influences the performance of parallel applications. In order to improve traditional data mapping policies, two basic strategies can be employed: optimizing locality or balance of memory accesses. In a locality-based policy, memory pages are mapped to nodes that access the page the most. In a balance-based policy, memory pages are mapped such that the number of memory accesses resolved by each memory controller is similar. In this paper, we perform an in-depth exploration of these data mapping policies on the performance of parallel applications. We introduce metrics that describe their memory access behavior and evaluate their suitability for data mapping. We also present new mapping policies that focus on locality, balance or both. These policies were evaluated on three different NUMA architectures with applications from the NAS-OMP and PARSEC benchmark suites. Results show that the performance improvements of each policy depend on the characteristics of the applications and machines. Choosing the wrong policy can actually hurt the performance compared to the default first-touch mapping. Compared to traditional mapping policies and to policies that only focus on either locality or balance, taking into account both locality and balance results in the highest improvements. Furthermore, it avoids the performance reduction caused by the wrong data mapping.
  • Keywords
    data handling; parallel architectures; NUMA nodes; NUMA systems; data mapping; data mapping policies; first-touch mapping; memory controller; memory pages; nonuniform memory access; parallel applications; parallel architectures; Benchmark testing; Instruction sets; Linux; Mathematical model; Measurement; Memory management; Data mapping; Load balance; Locality; NUMA;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2015 23rd Euromicro International Conference on
  • Conference_Location
    Turku
  • ISSN
    1066-6192
  • Type

    conf

  • DOI
    10.1109/PDP.2015.11
  • Filename
    7092693