DocumentCode
723688
Title
Compiler-Directed Transformation for Higher-Order Stencils
Author
Basu, Protonu ; Hall, Mary ; Williams, Samuel ; Van Straalen, Brian ; Oliker, Leonid ; Colella, Phillip
Author_Institution
Sch. of Comput., Univ. of Utah, Salt Lake City, UT, USA
fYear
2015
fDate
25-29 May 2015
Firstpage
313
Lastpage
323
Abstract
As the cost of data movement increasingly dominates performance, developers of finite-volume and finite-difference solutions for partial differential equations (PDEs) are exploring novel higher-order stencils that increase numerical accuracy and computational intensity. This paper describes a new compiler reordering transformation applied to stencil operators that performs partial sums in buffers, and reuses the partial sums in computing multiple results. This optimization has multiple effect son improving stencil performance that are particularly important to higher-order stencils: exploits data reuse, reduces floating-point operations, and exposes efficient SIMD parallelism to backend compilers. We study the benefit of this optimization in the context of Geometric Multigrid (GMG), a widely used method to solve PDEs, using four different Jacobi smoothers built from 7-, 13-, 27- and 125-point stencils. We quantify performance, speedup, and numerical accuracy, and use the Roofline model to qualify our results. Ultimately, we obtain over 4× speedup on the smoothers themselves and up to a 3× speedup on the multigrid solver. Finally, we demonstrate that high-order multigrid solvers have the potential of reducing total data movement and energy by several orders of magnitude.
Keywords
finite difference methods; finite volume methods; grid computing; optimising compilers; partial differential equations; GMG; Jacobi smoothers; PDEs; Roofline model; SIMD parallelism; backend compilers; compiler optimization; compiler reordering transformation; compiler-directed transformation; computational intensity; data movement cost; finite-difference solutions; finite-volume solutions; floating-point operations; geometric multigrid optimization; high-order multigrid solvers; higher-order stencils; numerical accuracy; partial differential equations; Accuracy; Approximation methods; Arrays; Jacobian matrices; Optimization; Poisson equations; Three-dimensional displays; Compiler Optimization; High-Order; Mehrstellen; Multigrid; Stencil;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
Conference_Location
Hyderabad
ISSN
1530-2075
Type
conf
DOI
10.1109/IPDPS.2015.103
Filename
7161520
Link To Document