Title :
Graphics Processing Unit Accelerated
Micromagnetic Solver
Author :
Li, Shaojing ; Livshitz, Boris ; Lomakin, Vitaliy
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California at San Diego, La Jolla, CA, USA
fDate :
6/1/2010 12:00:00 AM
Abstract :
An efficient micromagnetic solver running on graphics processing units (GPU) is demonstrated. The solver implements a nonuniform grid interpolation method (NGIM) to compute the superposition integral for the magnetostatic field with operations and memory requirements. The NGIM divides the computational domain into a hierarchy of boxes containing sources and observers, and it uses spatial interpolation from sparse nonuniform grids to achieve computational savings. Efficiency of the GPU solver is achieved by using coalesced memory accessing requiring arranging data in contiguous addresses, one-block-per-box computations with a block of threads handling an observation box to achieve the best utilization of the GPU threads, and on-fly computation of all grids and interpolation coefficients leading to reduced memory and increased speed. The GPU-CPU speed-ups are shown to be in the range 40-100 depending on the problem size and accuracy. A simple and inexpensive GPU is shown to handle efficiently problems comprising discretizations of more than 16 million of spins.
Keywords :
coprocessors; interpolation; magnetic fields; micromagnetics; parallel algorithms; physics computing; GPU threads; GPU-CPU speed-ups; accelerated O(N) micromagnetic solver; coalesced memory; computational domain; computational savings; contiguous addresses; data arrangement; graphics processing units; magnetostatic field; nonuniform grid interpolation method; observation box; one-block-per-box computations; sparse nonuniform grids; spatial interpolation; superposition integral; Acceleration; Character generation; Computational efficiency; Concurrent computing; Delay; Graphics processing unit; Interpolation; Micromagnetics; Programming environments; Fast algorithms; graphics processing unit; micromagnetics; parallel algorithms;
Journal_Title :
Magnetics, IEEE Transactions on
DOI :
10.1109/TMAG.2010.2043504