• DocumentCode
    2457721
  • Title

    OpenCL implementation of Cholesky matrix decomposition

  • Author

    Brunelli, Claudio ; Aho, Eero ; Berg, Heikki

  • Author_Institution
    Nokia Res. Center, Tampere, Finland
  • fYear
    2011
  • fDate
    Oct. 31 2011-Nov. 2 2011
  • Firstpage
    62
  • Lastpage
    67
  • Abstract
    This paper presents some OpenCL implementations for Cholesky decomposition, a very popular algorithm used in linear algebra and signal processing applications. The Cholesky algorithm represents a very interesting candidate for OpenCL implementation since it contains sequential parts besides parallel ones. Furthermore, one step involves just a small amount of calculations. These characteristics pose challenges which call for suitable techniques to overcome the limitations of the language. We propose several versions of the implementation of the Cholesky algorithm, then provide an analysis of the trade off between complexity and performance offered by each of them. We also analyze the differences between execution of the program on GPU and on multicore CPU.
  • Keywords
    computer graphic equipment; coprocessors; mathematics computing; matrix decomposition; multiprocessing systems; Cholesky matrix decomposition; GPU; OpenCL implementation; linear algebra; multicore CPU; signal processing applications; Algorithm design and analysis; Graphics processing unit; Kernel; Matrix decomposition; Multicore processing; Performance evaluation; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System on Chip (SoC), 2011 International Symposium on
  • Conference_Location
    Tampere
  • Print_ISBN
    978-1-4577-0671-4
  • Electronic_ISBN
    978-1-4577-0670-7
  • Type

    conf

  • DOI
    10.1109/ISSOC.2011.6089694
  • Filename
    6089694