• DocumentCode
    3204875
  • Title

    Overlapping Computation and Communication for Advection on Hybrid Parallel Computers

  • Author

    White, J.B., III ; Dongarra, J.J.

  • Author_Institution
    Climate & Global Dynamics, Nat. Center for Atmos. Res., Boulder, CO, USA
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    59
  • Lastpage
    67
  • Abstract
    We describe computational experiments exploring the performance improvements from overlapping computation and communication on hybrid parallel computers. Our test case is explicit time integration of linear advection with constant uniform velocity in a three-dimensional periodic domain. The test systems include a Cray XT5, a Cray XE6, and two multicore Infiniband clusters with different generations of NVIDIA graphics processing units (GPUs). We describe results for Fortran implementations using various combinations of MPI, OpenMP, and CUDA, with and without overlap of computation and communication. We find that overlapping CPU computation, GPU computation, parallel communication, and CPU-GPU communication can provide performance improvements of more than a factor of two.
  • Keywords
    message passing; multiprocessing systems; parallel processing; CPU-GPU communication; CUDA; Cray XE6; Cray XT5; Fortran; MPI; NVIDIA GPU; OpenMP; graphics processing units; hybrid parallel computers; linear advection; multicore Infiniband clusters; Central Processing Unit; Computers; Equations; Graphics processing unit; Instruction sets; Kernel; Mathematical model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
  • Conference_Location
    Anchorage, AK
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-372-8
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.16
  • Filename
    6012825