• DocumentCode
    244474
  • Title

    Managing the topology of heterogeneous cluster nodes with hardware locality (hwloc)

  • Author

    Goglin, Brice

  • Author_Institution
    Inria Bordeaux - Sud-ouest, Univ. of Bordeaux, Talence, France
  • fYear
    2014
  • fDate
    21-25 July 2014
  • Firstpage
    74
  • Lastpage
    81
  • Abstract
    Modern computing platforms are increasingly complex, with multiple cores, shared caches, and NUMA architectures. Parallel applications developers have to take locality into account before they can expect good efficiency on these platforms. Thus there is a strong need for a portable tool gathering and exposing this information. The Hardware Locality project (hwloc) offers a tree representation of the hardware based on the inclusion and localities of the CPU and memory resources. It is already widely used for affinity-based task placement in high performance computing. In this article we present how hwloc is extended to describe more than computing and memory resources. Indeed, I/O device locality is becoming another important aspect of locality since high performance GPUs, network or InfiniBand interfaces possess privileged access to some of the cores and memory banks. hwloc integrates this knowledge into its topology representation and offers an interoperability API to extend existing libraries such as CUDA with locality information. We also describe how hwloc now helps process managers and batch schedulers to deal with the topology of multiple cluster nodes, together with compression for better scalability up to thousands of nodes.
  • Keywords
    multiprocessing systems; parallel processing; CPU resource; CUDA; GPU; I/O device locality; InfiniBand interfaces; NUMA architecture; compute unified device architecture; graphics processing unit; hardware locality; heterogeneous cluster nodes; high performance computing; hwloc management; input-output device; locality information; memory resource; network interface; parallel applications; Graphics processing units; Hardware; Libraries; Performance evaluation; Sockets; Topology; I/O devices; affinities; clusters; hwloc; locality; topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing & Simulation (HPCS), 2014 International Conference on
  • Conference_Location
    Bologna
  • Print_ISBN
    978-1-4799-5312-7
  • Type

    conf

  • DOI
    10.1109/HPCSim.2014.6903671
  • Filename
    6903671