Title :
Throughput-precision computation for generic matrix multiplication: Toward a computation channel for high-performance digital signal processing
Author :
Anastasia, Davide ; Andreopoulos, Yiannis
Author_Institution :
Dept. of Electron. & Electr. Eng., Univ. Coll. London, London, UK
Abstract :
The generic matrix multiply (GEMM) subprogram is the core element of high-performance linear algebra software used in computationally-demanding digital signal processing (DSP) systems. We propose an acceleration technique for GEMM based on dynamically adjusting the precision of computation. Our technique employs DSP methods (such as scalar companding and rounding), followed by a new form of tight packing in floating-point that allows for concurrent calculation of multiple results. Since the companding process controls the increase of concurrency (via packing), the increase in processing throughput (and the corresponding loss in precision) depends on the input data statistics: low-variance parts of the matrix multiplication are computed faster than high-variance parts and the error is controlled in a stochastic and not in a worst-case sense. This can convert high-performance numerical DSP libraries into a computation channel where the output error increases when higher throughput is requested. Potential DSP applications that can benefit from the proposed approach are highlighted.
Keywords :
digital libraries; digital signal processing chips; linear algebra; matrix multiplication; process control; stochastic systems; DSP method; GEMM; companding process control; computationally-demanding digital signal processing system; data statistics; floating point; generic matrix multiplication; generic matrix multiply subprogram; high performance numerical DSP library; high-performance digital signal processing; high-performance linear algebra software; stochastic control; throughput-precision computation channel; tight packing; Acceleration; Digital signal processing; Kernel; Program processors; Signal to noise ratio; Throughput; BLAS level-3; high performance computing; matrix-based digital signal processing; stochastic error estimation;
Conference_Titel :
Digital Signal Processing (DSP), 2011 17th International Conference on
Conference_Location :
Corfu
Print_ISBN :
978-1-4577-0273-0
DOI :
10.1109/ICDSP.2011.6004973