DocumentCode
2174496
Title
hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications
Author
Broquedis, François ; Clet-Ortega, Jérôme ; Moreaud, Stéphanie ; Furmento, Nathalie ; Goglin, Brice ; Mercier, Guillaume ; Thibault, Samuel ; Namyst, Raymond
Author_Institution
LaBRI, Univ. of Bordeaux, Talence, France
fYear
2010
fDate
17-19 Feb. 2010
Firstpage
180
Lastpage
186
Abstract
The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications now have to carefully adapt their placement and behavior according to the underlying hierarchy of hardware resources and their software affinities. We introduce the Hardware Locality (hwloc) software which gathers hardware information about processors, caches, memory nodes and more, and exposes it to applications and runtime systems in a abstracted and portable hierarchical manner. hwloc may significantly help performance by having runtime systems place their tasks or adapt their communication strategies depending on hardware affinities. We show that hwloc can already be used by popular high-performance OpenMP or MPI software. Indeed, scheduling OpenMP threads according to their affinities or placing MPI processes according to their communication patterns shows interesting performance improvement thanks to hwloc. An optimized MPI communication strategy may also be dynamically chosen according to the location of the communicating processes in the machine and its hardware characteristics.
Keywords
application program interfaces; message passing; multi-threading; scheduling; MPI software; OpenMP thread scheduling; complex hardware topology; hardware affinities management; hardware locality software; high-performance computing; hwloc; memory nodes; multicore processor; runtime system; shared caches; software affinity; Application software; Bandwidth; Computer architecture; Concurrent computing; Conference management; Hardware; Memory management; Software libraries; Topology; Yarn; Hardware Topology Affinities Placement MPI OpenMP;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel, Distributed and Network-Based Processing (PDP), 2010 18th Euromicro International Conference on
Conference_Location
Pisa
ISSN
1066-6192
Print_ISBN
978-1-4244-5672-7
Electronic_ISBN
1066-6192
Type
conf
DOI
10.1109/PDP.2010.67
Filename
5452445
Link To Document