DocumentCode :
2139990
Title :
Evaluating the Performance of Basic Linear Algebra Subroutines on a Torus Array Processor
Author :
Zekri, Ahmed S. ; Sedukhin, Stanislav G.
Author_Institution :
Univ. of Aizu, Aizuwakamatsu
fYear :
2007
fDate :
16-19 Oct. 2007
Firstpage :
300
Lastpage :
305
Abstract :
The basic linear algebra subroutines (BLAS) are standard operations to efficiently solve the linear algebra problems on high performance and parallel systems. In this paper, we study the implementation of some important BLAS operations on a NtimesN torus array processor. We show that the performance of the Level-3 BLAS represented by the nxn matrix multiply-add operation, n>N, approaches the theoretical peak as n increases since the degree of data reusing is high. While the performance of Level-1 and Level-2 BLAS operations is low as a result of low data reusing. Fortunately, many applications are based on intensive use of Level-3 BLAS with small percentage of Level-1 and Level-2 BLAS.
Keywords :
linear algebra; parallel processing; BIAS; basic linear algebra subroutines; torus array processor; Algorithms; Application software; Concurrent computing; Coprocessors; Costs; Graphics; High performance computing; Linear algebra; Registers; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference on
Conference_Location :
Aizu-Wakamatsu, Fukushima
Print_ISBN :
978-0-7695-2983-7
Type :
conf
DOI :
10.1109/CIT.2007.166
Filename :
4385098
Link To Document :
بازگشت