Author_Institution :
Dept. of Comput. Sci., State Univ. of New York, Geneseo, NY, USA
Abstract :
This paper presents a novel unified run-time reconfigurable arithmetic processor design scheme. It provides novel computational trade-offs between array/matrix size and input data item bitwidth, and efficiently performs multiple types of arithmetic operations in pipeline within a single hardware-reusable processor. The proposed computations include inner product evaluation, matrix multiplication, and evaluation of polynomial. More specifically, we show that the minimum hardware platform can be easily reconfigured to complete: (1) the inner products of two input arrays with several combinations of array dimension and precision, including input arrays of 256 4-bit items, 64 8-bit items, 16 16-bit items, 4 32-bit items and I 64-bit item; (2) the product of matrices Xnk and Ykm for any integers n, k, m and any item precision b ranging from 4 to 64 bits, including input arrays of X16×16 Y16×16 of 4-bit items, X8×8 and Y8×8 of 8-bit items, X4×4, Y4×4 of 16-bit items, X2×2 and Y2×2 of 32-bit items and the product of two 64-bit numbers; (3) the polynomial evaluations at any given point x, with several combinations of the polynomial degree N and evaluation point number precision p, including polynomial degree and item precision options of N=64, p=13, N=16, p=16, and N=4, p=32
Keywords :
application specific integrated circuits; digital arithmetic; industrial property; matrix multiplication; microprocessor chips; multiplying circuits; pipeline arithmetic; reconfigurable architectures; 4 to 64 bit; arithmetic operations; array dimension; array size; computational trade-offs; evaluation point number precision; hardware-reusable processor; inner product evaluation; input data item bitwidth; item precision; matrix multiplication; polynomial degree; polynomial evaluations; unified reconfigurable arithmetic processor design; Central Processing Unit; Computer architecture; Computer science; Digital arithmetic; Hardware; Pipelines; Polynomials; Process design; Runtime; Very large scale integration;