• DocumentCode
    1660589
  • Title

    Tuning matrix-vector multiplication on GPU

  • Author

    Dziekonski, Adam ; Mrozowski, Michal

  • Author_Institution
    Dept. of Microwave & Antenna Eng., Gdansk Univ. of Technol., Gdansk, Poland
  • fYear
    2010
  • Firstpage
    167
  • Lastpage
    170
  • Abstract
    A matrix times vector multiplication (matvec) is a cornerstone operation in iterative methods of solving large sparse systems of equations such as the conjugate gradients method (cg), the minimal residual method (minres), the generalized residual method (gmres) and exerts an influence on overall performance of those methods. An implementation of matvec is particularly demanding when one executes computations on a GPU (Graphics Processing Unit), because using this device one has to comply with certain programming rules in order to take advantage of parallel computing. In this paper, it will be shown how to modify the sparse matrix-vector multiplication based on CRS (Compressed Row Storage) to achieve about 3-5 times better performance on - a low cost - GPU (GeForce GTX 285, 1.48 GHz) than on a CPU (Intel Core i7, 2.67 GHz).
  • Keywords
    computer graphic equipment; coprocessors; iterative methods; matrix multiplication; sparse matrices; vectors; CRS; GPU; compressed row storage; conjugate gradients method; cornerstone operation; generalized residual method; graphics processing unit; iterative methods; large sparse systems; matrix-vector multiplication; minimal residual method; parallel computing; programming rules; Acceleration; Computer architecture; Graphics; Graphics processing unit; Instruction sets; Kernel; CUDA; GPU; Sparse Matrix times Vector multiplication (SpMV);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology (ICIT), 2010 2nd International Conference on
  • Conference_Location
    Gdansk
  • Print_ISBN
    978-1-4244-8182-8
  • Type

    conf

  • Filename
    5553364