DocumentCode
779685
Title
Efficient technique for partitioning and programming linear algebra algorithms on concurrent VLSI architectures
Author
Di Zitti, E. ; Chirico, M. ; Curatelli, F. ; Bisio, G.M.
Author_Institution
Dept. of Biophys. & Electron. Eng., Genoa Univ., Italy
Volume
142
Issue
2
fYear
1995
fDate
4/1/1995 12:00:00 AM
Firstpage
97
Lastpage
104
Abstract
An efficient technique for partitioning and programming linear algebra algorithms on concurrent architectures is described and applied to 2-D wavefront arrays. The mapping of the computational elements (processes) to processors is based on the concept of folding. The mapping pattern on the 2-D full-size mesh of processes is composition of symmetric tiles of size 2√(N)×2√(N), N being the number of processors. The algorithm can be partitioned according to a globally sequential, locally parallel scheme. The code optimisation is performed by programming a few different types of tile, according to the algorithm. When the size of the problem is much larger than the size of the mesh of processors, a linear speed-up is achieved independently of the number of processors. Experimental results are presented for matrix multiplication, LU decomposition and the solution of triangular system equations on 2-D meshes of transputers programmed in Occam
Keywords
Occam; digital signal processing chips; linear algebra; multiprocessing systems; parallel algorithms; parallel architectures; transputer systems; 2D meshes; 2D wavefront arrays; LU decomposition; Occam; code optimisation; computational elements; concurrent VLSI architectures; folding; full-size mesh; linear algebra algorithms; linear speed-up; locally parallel scheme; mapping pattern; matrix multiplication; partitioning; symmetric tiles; transputers; triangular system equations;
fLanguage
English
Journal_Title
Circuits, Devices and Systems, IEE Proceedings -
Publisher
iet
ISSN
1350-2409
Type
jour
DOI
10.1049/ip-cds:19951795
Filename
384704
Link To Document