• DocumentCode
    2026486
  • Title

    Optimizing communications in multi-GPU Lattice Boltzmann simulations

  • Author

    Calore, Enrico ; Marchi, Davide ; Schifano, Sebastiano Fabio ; Tripiccione, Raffaele

  • Author_Institution
    Dipt. di Fis. e Sci. della Terra, Univ. di Ferrara, Ferrara, Italy
  • fYear
    2015
  • fDate
    20-24 July 2015
  • Firstpage
    55
  • Lastpage
    62
  • Abstract
    An increasingly large number of scientific applications run on large clusters based on GPU systems. In most cases the large scale parallelism of the applications uses MPI, widely recognized as the de-facto standard for building parallel applications, while several programming languages are used to express the parallelism available in the application and map it onto the parallel resources available on GPUs. Regular grids and stencil codes are used in a subset of these applications, often corresponding to computational “Grand Challenges”. One such class of applications are Lattice Boltzmann Methods (LB) used in computational fluid dynamics. The regular structure of LB algorithms makes them suitable for processor architectures with a large degree of parallelism like GPUs. Scalability of these applications on large clusters requires a careful design of processor-to-processor data communications, exploiting all possibilities to overlap communication and computation. This paper looks at these issues, considering as a use case a state-of-the-art two-dimensional LB model, that accurately reproduces the thermo-hydrodynamics of a 2D-fluid obeying the equation-of-state of a perfect gas. We study in details the interplay between data organization and data layout, data-communication options and overlapping of communication and computation. We derive partial models of some performance features and compare with experimental results for production-grade codes that we run on a large cluster of GPUs.
  • Keywords
    flow simulation; graphics processing units; lattice Boltzmann methods; 2D-fluid; GPU systems; MPI; communication optimization; computational fluid dynamics; data layout; data organization; equation-of-state; large-scale parallelism; multiGPU lattice Boltzmann simulations; overlap communication; overlap computation; parallel applications; parallel resources; perfect gas; processor architectures; processor-to-processor data communications; production-grade codes; programming languages; regular grids; scientific applications; stencil codes; thermo-hydrodynamics; two-dimensional LB model; Bandwidth; Graphics processing units; Lattices; Mathematical model; Sociology; Standards; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing & Simulation (HPCS), 2015 International Conference on
  • Conference_Location
    Amsterdam
  • Print_ISBN
    978-1-4673-7812-3
  • Type

    conf

  • DOI
    10.1109/HPCSim.2015.7237021
  • Filename
    7237021