DocumentCode :
723688
Title :
Compiler-Directed Transformation for Higher-Order Stencils
Author :
Basu, Protonu ; Hall, Mary ; Williams, Samuel ; Van Straalen, Brian ; Oliker, Leonid ; Colella, Phillip
Author_Institution :
Sch. of Comput., Univ. of Utah, Salt Lake City, UT, USA
fYear :
2015
fDate :
25-29 May 2015
Firstpage :
313
Lastpage :
323
Abstract :
As the cost of data movement increasingly dominates performance, developers of finite-volume and finite-difference solutions for partial differential equations (PDEs) are exploring novel higher-order stencils that increase numerical accuracy and computational intensity. This paper describes a new compiler reordering transformation applied to stencil operators that performs partial sums in buffers, and reuses the partial sums in computing multiple results. This optimization has multiple effect son improving stencil performance that are particularly important to higher-order stencils: exploits data reuse, reduces floating-point operations, and exposes efficient SIMD parallelism to backend compilers. We study the benefit of this optimization in the context of Geometric Multigrid (GMG), a widely used method to solve PDEs, using four different Jacobi smoothers built from 7-, 13-, 27- and 125-point stencils. We quantify performance, speedup, and numerical accuracy, and use the Roofline model to qualify our results. Ultimately, we obtain over 4× speedup on the smoothers themselves and up to a 3× speedup on the multigrid solver. Finally, we demonstrate that high-order multigrid solvers have the potential of reducing total data movement and energy by several orders of magnitude.
Keywords :
finite difference methods; finite volume methods; grid computing; optimising compilers; partial differential equations; GMG; Jacobi smoothers; PDEs; Roofline model; SIMD parallelism; backend compilers; compiler optimization; compiler reordering transformation; compiler-directed transformation; computational intensity; data movement cost; finite-difference solutions; finite-volume solutions; floating-point operations; geometric multigrid optimization; high-order multigrid solvers; higher-order stencils; numerical accuracy; partial differential equations; Accuracy; Approximation methods; Arrays; Jacobian matrices; Optimization; Poisson equations; Three-dimensional displays; Compiler Optimization; High-Order; Mehrstellen; Multigrid; Stencil;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
Conference_Location :
Hyderabad
ISSN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2015.103
Filename :
7161520
Link To Document :
بازگشت