Title :
Throughput-Distortion Computation of Generic Matrix Multiplication: Toward a Computation Channel for Digital Signal Processing Systems
Author :
Anastasia, Davide ; Andreopoulos, Yiannis
Author_Institution :
Dept. of Electron. & Electr. Eng., Univ. Coll. London, London, UK
fDate :
4/1/2012 12:00:00 AM
Abstract :
The generic matrix multiply (GEMM) function is the core element of high-performance linear algebra libraries used in many computationally demanding digital signal processing (DSP) systems. We propose an acceleration technique for GEMM based on dynamically adjusting the imprecision (distortion) of computation. Our technique employs adaptive scalar companding and rounding to input matrix blocks followed by two forms of packing in floating-point that allow for concurrent calculation of multiple results. Since the adaptive companding process controls the increase of concurrency (via packing), the increase in processing throughput (and the corresponding increase in distortion) depends on the input data statistics. To demonstrate this, we derive the optimal throughput-distortion control framework for GEMM for the broad class of zero-mean, independent identically distributed, input sources. Our approach converts matrix multiplication in programmable processors into a computation channel: when increasing the processing throughput, the output noise (error) increases due to: (i) coarser quantization; and (ii) computational errors caused by exceeding the machine-precision limitations. We show that, under certain distortion in the GEMM computation, the proposed framework can significantly surpass 100% of the peak performance of a given processor. The practical benefits of our proposal are shown in a face recognition system and a multilayer perceptron system trained for metadata learning from a large music feature database.
Keywords :
matrix multiplication; signal processing; computation channel; digital signal processing systems; face recognition system; generic matrix multiply function; high-performance linear algebra libraries; matrix multiplication; metadata learning; multilayer perceptron system; music feature database; throughput-distortion computation; Acceleration; Digital signal processing; Noise; Program processors; Proposals; Quantization; Throughput; BLAS level-3; high performance computing; matrix-based DSP; stochastic error estimation; throughput-distortion tradeoffs in DSP;
Journal_Title :
Signal Processing, IEEE Transactions on
DOI :
10.1109/TSP.2011.2176337