DocumentCode
2797910
Title
LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware
Author
Galoppo, Nico ; Govindaraju, Naga K. ; Henson, Michael ; Manocha, Dinesh
Author_Institution
University of North Carolina at Chapel Hill
fYear
2005
fDate
12-18 Nov. 2005
Firstpage
3
Lastpage
3
Abstract
We present a novel algorithm to solve dense linear systems using graphics processors (GPUs). We reduce matrix decomposition and row operations to a series of rasterization problems on the GPU. These include new techniques for streaming index pairs, swapping rows and columns and parallelizing the computation to utilize multiple vertex and fragment processors. We also use appropriate data representations to match the rasterization order and cache technology of graphics processors. We have implemented our algorithm on different GPUs and compared the performance with optimized CPU implementations. In particular, our implementation on a NVIDIA GeForce 7800 GPU outperforms a CPU-based ATLAS implementation. Moreover, our results show that our algorithm is cache and bandwidth efficient and scales well with the number of fragment processors within the GPU and the core GPU clock rate. We use our algorithm for fluid flow simulation and demonstrate that the commodity GPU is a useful co-processor for many scientific applications.
Keywords
Appropriate technology; Bandwidth; Clocks; Concurrent computing; Coprocessors; Fluid flow; Graphics; Hardware; Linear systems; Matrix decomposition;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 2005. Proceedings of the ACM/IEEE SC 2005 Conference
Print_ISBN
1-59593-061-2
Type
conf
DOI
10.1109/SC.2005.42
Filename
1559955
Link To Document