Title :
GPU-accelerated WZ factorization with the use of the CUBLAS library
Author :
Bylina, Beata ; Bylina, Jaroslaw
Author_Institution :
Inst. of Math., Marie Curie-Sklodowska Univ., Lublin, Poland
Abstract :
We present a novel implementation of a dense, square, non-structured matrix factorization algorithm, namely the WZ factorization - with the use of graphics processors (GPUs) and CPUs to gain a high performance at a low cost. We rewrite this factorization as operations on blocks of matrices and vectors. We have implemented our block-vector algorithm on GPUs with the use of an appropriate (and ready-to-use) GPU-accelerated mathematical library, namely the CUBLAS library. We compared the performance of our algorithm with CPU implementations. In particular, our implementation on an NVIDIA Tesla C2050 GPU outperforms a CPU-based implementation. Our results show that the algorithm scales well with the size of matrices; moreover, the larger the matrix, the better the performance. We also discuss the impact of the size of the matrix and the use of ready-to-use mathematical libraries on the numerical accuracy.
Keywords :
graphics processing units; libraries; mathematics computing; matrix decomposition; vectors; CPU; CUBLAS library; GPU-accelerated WZ factorization; GPU-accelerated mathematical library; NVIDIA Tesla C2050 GPU; block-vector algorithm; dense matrix factorization algorithm; graphics processors; nonstructured matrix factorization algorithm; numerical accuracy; square matrix factorization algorithm; Computer architecture; Graphics processing units; Libraries; Linear systems; Matrix decomposition; Software algorithms; Vectors; GPU; WZ factorization; matrix factorization; parallel computing;
Conference_Titel :
Computer Science and Information Systems (FedCSIS), 2012 Federated Conference on
Conference_Location :
Wroclaw
Print_ISBN :
978-1-4673-0708-6
Electronic_ISBN :
978-83-60810-51-4