DocumentCode :
3043448
Title :
Sparse matrix transpose unit
Author :
Stathis, Pyrrhos ; Cheresiz, Dmitry ; Vassiliadis, Stamatis ; Juurlink, Ben
Author_Institution :
Dept. of Electr. Eng., Math. & Comput. Sci., Delft Univ. of Technol., Netherlands
fYear :
2004
fDate :
26-30 April 2004
Firstpage :
90
Abstract :
Summary form only given. A large number of scientific applications involve the operation on, and manipulation of sparse matrices. Irregular structure of these matrices, however, causes hardware that otherwise behaves efficient on regular data to severely suffer in performance when handling sparse matrices. In order to tackle this problem, a scheme consisting of a novel hierarchical sparse matrix (HiSM) storage format and an associated architectural concept have been presented. We propose, describe, and evaluate a hardware mechanism to facilitate transposition of a sparse matrix stored in the HiSM format. The proposed hardware is meant to be embedded in a vector processor as a functional unit. The main part of the unit consists of an s × s word in-processor memory, where s is the vector processor´s section size. We determine suitable parameters for the proposed mechanism and study the performance of HiSM-based transposition using the matrices from the D-SAB benchmark suite. We show that the HiSM-based transposition executed on a vector processor equipped with the proposed unit exhibits speedups of up to 32.0 times with respect to the transposition based on the most widely used compressed row storage format and executed on a standard vector processor. When considering average speedup, depending on the properties of matrices being transposed, such as the size and the organization of nonzero elements, a speedup by a factor between 15.5 and 20 has been observed.
Keywords :
benchmark testing; performance evaluation; sparse matrices; storage management; vector processor systems; D-SAB benchmark suite; HiSM-based transposition; compressed row storage format; functional unit; hierarchical sparse matrix storage format; in-processor memory; vector processor; Application software; Computer architecture; Computer science; Costs; Hardware; Laboratories; Mathematics; Scientific computing; Sparse matrices; Vector processors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
Type :
conf
DOI :
10.1109/IPDPS.2004.1303033
Filename :
1303033
Link To Document :
بازگشت