• DocumentCode
    704173
  • Title

    Lattice Boltzmann Simulations at Petascale on Multi-GPU Systems with Asynchronous Data Transfer and Strictly Enforced Memory Read Alignment

  • Author

    Robertsen, Fredrik ; Westerholm, Jan ; Mattila, Keijo

  • Author_Institution
    Dept. of Inf. Technol., Abo Akademi Univ., Abo, Finland
  • fYear
    2015
  • fDate
    4-6 March 2015
  • Firstpage
    604
  • Lastpage
    609
  • Abstract
    The lattice Boltzmann method is a well-established numerical approach for complex fluid flow simulations. Recently general-purpose graphics processing units have become accessible as high-performance computing resources at large-scale. We report on implementing a lattice Boltzmann solver for multi-GPU systems that achieves 0.69 PFLOPS performance on 16384 GPUs. In addition to optimizing the data layout on the GPUs and eliminating the halo sites, we make use of the possibility to overlap data transfer between the host CPU and the device GPU with computing on the GPU. We simulate flow in porous media and measure both strong and weak scaling performance with the emphasis being on a large scale simulation using realistic input data.
  • Keywords
    flow simulation; graphics processing units; lattice Boltzmann methods; parallel processing; Petascale; asynchronous data transfer; complex fluid flow simulations; data layout; general-purpose graphics processing units; high-performance computing; lattice Boltzmann simulations; memory read alignment; multiGPU system; Bandwidth; Computational modeling; Data transfer; Graphics processing units; Kernel; Lattice Boltzmann methods; GPU; Lattice Boltzmann; Titan; asynchronous communication; memory alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2015 23rd Euromicro International Conference on
  • Conference_Location
    Turku
  • ISSN
    1066-6192
  • Type

    conf

  • DOI
    10.1109/PDP.2015.71
  • Filename
    7092782