DocumentCode :
3244597
Title :
Solving 2D Nonlinear Unsteady Convection-Diffusion Equations on Heterogenous Platforms with Multiple GPUs
Author :
Yang, Canqun ; Ge, Zhen ; Chen, Juan ; Wang, Feng ; Du, Yunfei
Author_Institution :
Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
fYear :
2009
fDate :
8-11 Dec. 2009
Firstpage :
961
Lastpage :
966
Abstract :
Solving complex convection-diffusion equations is very important to many practical mathematical and physical problems. After the finite difference discretization, most of the time for equations solution is spent on sparse linear equation solvers. In this paper, our goal is to solve 2D Nonlinear Unsteady Convection-Diffusion Equations by accelerating an iterative algorithm named Jacobi-preconditioned QMRCGSTAB on a heterogenous platform, which is composed of a multi-core processor and multiple GPUs. Firstly, a basic implementation and evaluation for adapting the problem to this kind of platform is given. Then, we propose two optimization methods to improve the performance: kernel merging method and matrix boundary data processing. Our experimental evaluation on an AMD Opteron(tm) quad-core processor 2380 linked to an NVIDIA Tesla S1070 platform with four GPUs delivers the peak performance of 33 GFLOPS (double precision), which is a speedup of close to a factor 32 compared to the same problem running on 4 cores of the same CPU.
Keywords :
computer graphics; coprocessors; finite difference methods; iterative methods; mathematical analysis; 2D nonlinear unsteady convection-diffusion equations; Jacobi preconditioned QMRCGSTAB; accelerating iterative algorithm; finite difference discretization; kernel merging method; matrix boundary data processing; multicore processor; multiple GPU; quad core processor; sparse linear equation solvers; Acceleration; Difference equations; Finite difference methods; Iterative algorithms; Jacobian matrices; Kernel; Merging; Multicore processing; Nonlinear equations; Optimization methods; GPU; NUCDE solver; PQMRCGSTAB; accelerate; heterogenous;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2009 15th International Conference on
Conference_Location :
Shenzhen
ISSN :
1521-9097
Print_ISBN :
978-1-4244-5788-5
Type :
conf
DOI :
10.1109/ICPADS.2009.76
Filename :
5395290
Link To Document :
بازگشت