Title :
On synthesizing optimal family of linear systolic arrays for matrix multiplication
Author :
Kumar, V. K Prasanna ; Tsai, Yu-Chen
Author_Institution :
Dept. of Electr. Eng.-Syst., Univ. of Southern California, Los Angeles, CA, USA
fDate :
6/1/1991 12:00:00 AM
Abstract :
The authors describe a family of linear systolic arrays for matrix multiplication exhibiting a tradeoff between local storage and the number of processing elements (PEs). The design consists of processors hooked into a linear array with each processor having storage s, 1⩽s⩽n, for n×n matrix multiplication, where the number of processors equals n times the least integer ⩾n/s. The input matrices are fed as two speed data streams using fast and slow channels to satisfy the dependencies in the usual matrix multiplication algorithm. While a family of linear arrays have been synthesized for this problem, this technique leads to simpler designs with fewer number of processors and improved delay from input to output. All these designs use the optimal number of processors for local storage in the range 1⩽s⩽n. The data flow is unidirectional, which makes the designs implementable on fault wafer scale integration models
Keywords :
VLSI; circuit layout CAD; systolic arrays; delay; fault wafer scale integration models; local storage; matrix multiplication; optimal family of linear systolic arrays; processing elements; Costs; Data structures; Dynamic programming; Hypercubes; Load management; Parallel algorithms; Parallel processing; Semiconductor device modeling; Systolic arrays; Very large scale integration;
Journal_Title :
Computers, IEEE Transactions on