DocumentCode
2457721
Title
OpenCL implementation of Cholesky matrix decomposition
Author
Brunelli, Claudio ; Aho, Eero ; Berg, Heikki
Author_Institution
Nokia Res. Center, Tampere, Finland
fYear
2011
fDate
Oct. 31 2011-Nov. 2 2011
Firstpage
62
Lastpage
67
Abstract
This paper presents some OpenCL implementations for Cholesky decomposition, a very popular algorithm used in linear algebra and signal processing applications. The Cholesky algorithm represents a very interesting candidate for OpenCL implementation since it contains sequential parts besides parallel ones. Furthermore, one step involves just a small amount of calculations. These characteristics pose challenges which call for suitable techniques to overcome the limitations of the language. We propose several versions of the implementation of the Cholesky algorithm, then provide an analysis of the trade off between complexity and performance offered by each of them. We also analyze the differences between execution of the program on GPU and on multicore CPU.
Keywords
computer graphic equipment; coprocessors; mathematics computing; matrix decomposition; multiprocessing systems; Cholesky matrix decomposition; GPU; OpenCL implementation; linear algebra; multicore CPU; signal processing applications; Algorithm design and analysis; Graphics processing unit; Kernel; Matrix decomposition; Multicore processing; Performance evaluation; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
System on Chip (SoC), 2011 International Symposium on
Conference_Location
Tampere
Print_ISBN
978-1-4577-0671-4
Electronic_ISBN
978-1-4577-0670-7
Type
conf
DOI
10.1109/ISSOC.2011.6089694
Filename
6089694
Link To Document