Title :
Optimizing communications in multi-GPU Lattice Boltzmann simulations
Author :
Calore, Enrico ; Marchi, Davide ; Schifano, Sebastiano Fabio ; Tripiccione, Raffaele
Author_Institution :
Dipt. di Fis. e Sci. della Terra, Univ. di Ferrara, Ferrara, Italy
Abstract :
An increasingly large number of scientific applications run on large clusters based on GPU systems. In most cases the large scale parallelism of the applications uses MPI, widely recognized as the de-facto standard for building parallel applications, while several programming languages are used to express the parallelism available in the application and map it onto the parallel resources available on GPUs. Regular grids and stencil codes are used in a subset of these applications, often corresponding to computational “Grand Challenges”. One such class of applications are Lattice Boltzmann Methods (LB) used in computational fluid dynamics. The regular structure of LB algorithms makes them suitable for processor architectures with a large degree of parallelism like GPUs. Scalability of these applications on large clusters requires a careful design of processor-to-processor data communications, exploiting all possibilities to overlap communication and computation. This paper looks at these issues, considering as a use case a state-of-the-art two-dimensional LB model, that accurately reproduces the thermo-hydrodynamics of a 2D-fluid obeying the equation-of-state of a perfect gas. We study in details the interplay between data organization and data layout, data-communication options and overlapping of communication and computation. We derive partial models of some performance features and compare with experimental results for production-grade codes that we run on a large cluster of GPUs.
Keywords :
flow simulation; graphics processing units; lattice Boltzmann methods; 2D-fluid; GPU systems; MPI; communication optimization; computational fluid dynamics; data layout; data organization; equation-of-state; large-scale parallelism; multiGPU lattice Boltzmann simulations; overlap communication; overlap computation; parallel applications; parallel resources; perfect gas; processor architectures; processor-to-processor data communications; production-grade codes; programming languages; regular grids; scientific applications; stencil codes; thermo-hydrodynamics; two-dimensional LB model; Bandwidth; Graphics processing units; Lattices; Mathematical model; Sociology; Standards; Statistics;
Conference_Titel :
High Performance Computing & Simulation (HPCS), 2015 International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4673-7812-3
DOI :
10.1109/HPCSim.2015.7237021