• DocumentCode
    573270
  • Title

    Novel GPU implementation of Jacobi algorithm for Karhunen-Loève transform of dense matrices

  • Author

    Torun, Mustafa U. ; Yilmaz, Onur ; Akansu, Ali N.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. Heights, Newark, NJ, USA
  • fYear
    2012
  • fDate
    21-23 March 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Jacobi algorithm for Karhunen-Loève transform of a symmetric real matrix, and its parallel implementation using chess tournament algorithm are revisited in this paper. Impact of memory access patterns and significance of memory coalescing on the performance of the GPU implementation for the parallel Jacobi algorithm are emphasized. Two novel memory access methods for the Jacobi algorithm are proposed. It is shown with simulation results that one of the proposed methods achieves 77.3% computational performance improvement over the traditional GPU methods, and it runs 73.5 times faster than CPU for a dense symmetric square matrix of size 1,024.
  • Keywords
    Jacobian matrices; Karhunen-Loeve transforms; graphics processing units; parallel algorithms; parallel architectures; CUDA; GPU implementation; Karhunen-Loeve transform; chess tournament algorithm; dense symmetric square matrix; memory access pattern; memory coalescing; parallel Jacobi algorithm; symmetric real matrix; Graphics processing unit; Instruction sets; Karhunen-Loeve transforms; CUDA; Eigendecomposition; GPU Computing; Jacobi Algorithm; KLT;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Sciences and Systems (CISS), 2012 46th Annual Conference on
  • Conference_Location
    Princeton, NJ
  • Print_ISBN
    978-1-4673-3139-5
  • Electronic_ISBN
    978-1-4673-3138-8
  • Type

    conf

  • DOI
    10.1109/CISS.2012.6310720
  • Filename
    6310720