DocumentCode
3244597
Title
Solving 2D Nonlinear Unsteady Convection-Diffusion Equations on Heterogenous Platforms with Multiple GPUs
Author
Yang, Canqun ; Ge, Zhen ; Chen, Juan ; Wang, Feng ; Du, Yunfei
Author_Institution
Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
fYear
2009
fDate
8-11 Dec. 2009
Firstpage
961
Lastpage
966
Abstract
Solving complex convection-diffusion equations is very important to many practical mathematical and physical problems. After the finite difference discretization, most of the time for equations solution is spent on sparse linear equation solvers. In this paper, our goal is to solve 2D Nonlinear Unsteady Convection-Diffusion Equations by accelerating an iterative algorithm named Jacobi-preconditioned QMRCGSTAB on a heterogenous platform, which is composed of a multi-core processor and multiple GPUs. Firstly, a basic implementation and evaluation for adapting the problem to this kind of platform is given. Then, we propose two optimization methods to improve the performance: kernel merging method and matrix boundary data processing. Our experimental evaluation on an AMD Opteron(tm) quad-core processor 2380 linked to an NVIDIA Tesla S1070 platform with four GPUs delivers the peak performance of 33 GFLOPS (double precision), which is a speedup of close to a factor 32 compared to the same problem running on 4 cores of the same CPU.
Keywords
computer graphics; coprocessors; finite difference methods; iterative methods; mathematical analysis; 2D nonlinear unsteady convection-diffusion equations; Jacobi preconditioned QMRCGSTAB; accelerating iterative algorithm; finite difference discretization; kernel merging method; matrix boundary data processing; multicore processor; multiple GPU; quad core processor; sparse linear equation solvers; Acceleration; Difference equations; Finite difference methods; Iterative algorithms; Jacobian matrices; Kernel; Merging; Multicore processing; Nonlinear equations; Optimization methods; GPU; NUCDE solver; PQMRCGSTAB; accelerate; heterogenous;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems (ICPADS), 2009 15th International Conference on
Conference_Location
Shenzhen
ISSN
1521-9097
Print_ISBN
978-1-4244-5788-5
Type
conf
DOI
10.1109/ICPADS.2009.76
Filename
5395290
Link To Document