DocumentCode :
1689106
Title :
Evaluation and tuning of the Level 3 CUBLAS for graphics processors
Author :
Barrachina, Sergio ; Castillo, Maribel ; Igual, Francisco D. ; Mayo, Rafael ; Quintana-Ortí, Enrique S.
Author_Institution :
Depto. de Ing. y Cienc. de Comput., Univ. Jaume I, Castellon
fYear :
2008
Firstpage :
1
Lastpage :
8
Abstract :
The increase in performance of the last generations of graphics processors (GPUs) has made this class of platform a coprocessing tool with remarkable success in certain types of operations. In this paper we evaluate the performance of the Level 3 operations in CUBLAS, the implementation of BIAS for NVIDIAreg GPUs with unified architecture. From this study, we gain insights on the quality of the kernels in the library and we propose several alternative implementations that are competitive with those in CUBLAS. Experimental results on a GeForce 8800 Ultra compare the performance of CUBLAS and the new variants.
Keywords :
coprocessors; linear algebra; mathematics computing; NVIDIA GPU; coprocessing tool; graphics processors; level 3 CUBLAS; unified architecture; Clocks; Computer architecture; Frequency; Graphics; Hardware; Heart; Kernel; Libraries; Linear algebra; Pipelines; BLAS; Graphics processors; high performance; linear algebra;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
Conference_Location :
Miami, FL
ISSN :
1530-2075
Print_ISBN :
978-1-4244-1693-6
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2008.4536485
Filename :
4536485
Link To Document :
بازگشت