• DocumentCode
    71620
  • Title

    Efficient Parallel GPU Design on WRF Five-Layer Thermal Diffusion Scheme

  • Author

    Melin Huang ; Bormin Huang ; Yang-Lang Chang ; Mielikainen, Jarno ; Huang, Hung-Lung Allen ; Goldberg, Mitchell D.

  • Author_Institution
    Space Sci. & Eng. Center, Univ. of Wisconsin-Madison, Madison, WI, USA
  • Volume
    8
  • Issue
    5
  • fYear
    2015
  • fDate
    May-15
  • Firstpage
    2249
  • Lastpage
    2259
  • Abstract
    Satellite remote-sensing observations and ground-based radar can detect the weather conditions from a distance and are widely used to monitor the weather all around the globe. The assimilated satellite/radar data are passed through the weather models for weather forecasting. The five-layer thermal diffusion scheme is one of the weather models, handling with an energy budget made up of sensible, latent, and radiative heat fluxes. The model feature of no interactions among horizontal grid points makes this scheme very favorable for parallel processing. This study demonstrates implementation of this scheme using graphics processing unit (GPU) massively parallel architecture. By employing one NVIDIA Tesla K40 GPU, our GPU optimization effort on this scheme achieves a speedup of 311× with respect to its CPU counterpart Fortran code running on one CPU core of Intel Xeon E5-2603, whereas the speedup for one CPU socket (four cores) with respect to one CPU core is only 3.1×. We can even boost the speedup of this scheme to 398× with respect to one CPU core when two NVIDIA Tesla K40 GPUs are applied.
  • Keywords
    data assimilation; geophysics computing; graphics processing units; meteorological radar; parallel architectures; remote sensing by radar; thermal diffusion; weather forecasting; CPU core; CPU socket; Fortran code; GPU optimization; Intel Xeon E5-2603; NVIDIA Tesla K40 GPU; WRF five-layer thermal diffusion scheme; Weather Research and Forecasting; energy budget; graphics processing unit; ground-based radar; horizontal grid point; parallel GPU design; parallel architecture; parallel processing; radar data assimilation; radiative heat flux; satellite remote-sensing observation; weather condition; weather forecasting; weather model; weather monitoring; Computational modeling; Graphics processing units; Heating; Instruction sets; Land surface temperature; Ocean temperature; Soil; Compute unified device architecture (CUDA); five-layer thermal diffusion; graphics processing unit (GPU); heat flux; radiation;
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1939-1404
  • Type

    jour

  • DOI
    10.1109/JSTARS.2015.2422268
  • Filename
    7110533