• DocumentCode
    3244597
  • Title

    Solving 2D Nonlinear Unsteady Convection-Diffusion Equations on Heterogenous Platforms with Multiple GPUs

  • Author

    Yang, Canqun ; Ge, Zhen ; Chen, Juan ; Wang, Feng ; Du, Yunfei

  • Author_Institution
    Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2009
  • fDate
    8-11 Dec. 2009
  • Firstpage
    961
  • Lastpage
    966
  • Abstract
    Solving complex convection-diffusion equations is very important to many practical mathematical and physical problems. After the finite difference discretization, most of the time for equations solution is spent on sparse linear equation solvers. In this paper, our goal is to solve 2D Nonlinear Unsteady Convection-Diffusion Equations by accelerating an iterative algorithm named Jacobi-preconditioned QMRCGSTAB on a heterogenous platform, which is composed of a multi-core processor and multiple GPUs. Firstly, a basic implementation and evaluation for adapting the problem to this kind of platform is given. Then, we propose two optimization methods to improve the performance: kernel merging method and matrix boundary data processing. Our experimental evaluation on an AMD Opteron(tm) quad-core processor 2380 linked to an NVIDIA Tesla S1070 platform with four GPUs delivers the peak performance of 33 GFLOPS (double precision), which is a speedup of close to a factor 32 compared to the same problem running on 4 cores of the same CPU.
  • Keywords
    computer graphics; coprocessors; finite difference methods; iterative methods; mathematical analysis; 2D nonlinear unsteady convection-diffusion equations; Jacobi preconditioned QMRCGSTAB; accelerating iterative algorithm; finite difference discretization; kernel merging method; matrix boundary data processing; multicore processor; multiple GPU; quad core processor; sparse linear equation solvers; Acceleration; Difference equations; Finite difference methods; Iterative algorithms; Jacobian matrices; Kernel; Merging; Multicore processing; Nonlinear equations; Optimization methods; GPU; NUCDE solver; PQMRCGSTAB; accelerate; heterogenous;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems (ICPADS), 2009 15th International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1521-9097
  • Print_ISBN
    978-1-4244-5788-5
  • Type

    conf

  • DOI
    10.1109/ICPADS.2009.76
  • Filename
    5395290