Title :
Relax-Miracle: GPU parallelization of semi-analytic fourier-domain solvers for earthquake modeling
Author :
Masuti, Sagar Shrishailappa ; Barbot, Sylvain ; Kapre, Nachiket
Author_Institution :
Earth Obs. of Singapore, Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Effective utilization of GPU processing capacity for scientific workloads is often limited by memory throughput and PCIe communication transfer times. This is particularly true for semi-analytic Fourier-domain computations in earthquake modeling (Relax) where operations on large-scale 3D data structures can require moving large volumes of data from storage to the compute in predictable but orthogonal access patterns. We show how to transform the computation to avoid PCIe transfers entirely by reconstructing the 3D data structures directly within the GPU global memory. We also consider arithmetic transformations that replace some communication-intensive 1D FFTs with simpler, data-parallel analytical solutions. Using our approach we are able to reduce computation times for a geophysical model of the 2012 Mw8.7 Wharton Basin earthquake from 2 hours down to 15 minutes (speedup of ≈8x) for grid sizes of 512-512-256 when comparing NVIDIA K20 with a 16-threaded Intel Xeon E5-2670 CPU (supported by Intel-MKL libraries). Our GPU-accelerated solution (called Relax-Miracle) also makes it possible to conduct Markov-Chain Monte-Carlo simulations using more than 1000 time-dependent models on 12 GPUs per single day of calculation, enhancing our ability to use such techniques for time-consuming data inversion and Bayesian inversion experiments.
Keywords :
Fourier analysis; Markov processes; Monte Carlo methods; data structures; earthquake engineering; geophysics computing; graphics processing units; parallel processing; 16-threaded Intel Xeon E5-2670 CPU; 1D FFTs; 3D data structures; Bayesian inversion experiments; GPU global memory; GPU parallelization; GPU processing capacity; GPU-accelerated solution; Intel-MKL libraries; Markov-Chain Monte-Carlo simulations; NVIDIA K20; PCIe communication transfer times; Relax-Miracle; Wharton Basin earthquake; arithmetic transformations; data inversion; data-parallel analytical solutions; earthquake modeling; geophysical model; memory throughput; orthogonal access patterns; scientific workloads; semianalytic Fourier-domain computations; semianalytic Fourier-domain solvers; Computational modeling; Earthquakes; Graphics processing units; Green´s function methods; Mathematical model; Stress; Three-dimensional displays;
Conference_Titel :
High Performance Computing (HiPC), 2014 21st International Conference on
Print_ISBN :
978-1-4799-5975-4
DOI :
10.1109/HiPC.2014.7116901