• DocumentCode
    2262986
  • Title

    Iterative SLE Solvers over a CPU-GPU Platform

  • Author

    Binotto, Alécio P D ; Daniel, Christian ; Weber, Daniel ; Kuijper, Arjan ; Stork, André ; Pereira, Carlos ; Fellner, Dieter

  • Author_Institution
    Tech. Universitdt Darmstadt, Darmstadt, Germany
  • fYear
    2010
  • fDate
    1-3 Sept. 2010
  • Firstpage
    305
  • Lastpage
    313
  • Abstract
    GPUs (Graphics Processing Units) have become one of the main co-processors that contributed to desktops towards high performance computing. Together with multi-core CPUs, a powerful heterogeneous execution platform is built for massive calculations. To improve application performance and explore this heterogeneity, a distribution of workload in a balanced way over the PUs (Processing Units) plays an important role for the system. However, this problem faces challenges since the cost of a task at a PU is non-deterministic and can be influenced by several parameters not known a priori, like the problem size domain. We present a comparison of iterative SLE (Systems of Linear Equations) solvers, used in many scientific and engineering applications, over a heterogeneous CPU-GPUs platform and characterize scenarios where the solvers obtain better performances. A new technique to improve memory access on matrix-vector multiplication used by SLEs on GPUs is described and compared to standard implementations for CPU and GPUs. Such timing profiling is analyzed and break-even points based on the problem sizes are identified for this implementation, pointing whether our technique is faster to use GPU instead of CPU. Preliminary results show the importance of this study applied to a real-time CFD (Computational Fluid Dynamics) application with geometry modification.
  • Keywords
    computational fluid dynamics; coprocessors; geometry; iterative methods; mathematics computing; matrix multiplication; multiprocessing systems; CPU-GPU platform; computational fluid dynamics; coprocessors; geometry modification; graphics processing units; high performance computing; iterative SLE solvers; matrix vector multiplication; memory access; multicore CPU; Graphics processors; parallel processing; real-time CFD; solvers for SLEs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications (HPCC), 2010 12th IEEE International Conference on
  • Conference_Location
    Melbourne, VIC
  • Print_ISBN
    978-1-4244-8335-8
  • Electronic_ISBN
    978-0-7695-4214-0
  • Type

    conf

  • DOI
    10.1109/HPCC.2010.40
  • Filename
    5581464