DocumentCode :
3083586
Title :
Throughput-precision computation for generic matrix multiplication: Toward a computation channel for high-performance digital signal processing
Author :
Anastasia, Davide ; Andreopoulos, Yiannis
Author_Institution :
Dept. of Electron. & Electr. Eng., Univ. Coll. London, London, UK
fYear :
2011
fDate :
6-8 July 2011
Firstpage :
1
Lastpage :
6
Abstract :
The generic matrix multiply (GEMM) subprogram is the core element of high-performance linear algebra software used in computationally-demanding digital signal processing (DSP) systems. We propose an acceleration technique for GEMM based on dynamically adjusting the precision of computation. Our technique employs DSP methods (such as scalar companding and rounding), followed by a new form of tight packing in floating-point that allows for concurrent calculation of multiple results. Since the companding process controls the increase of concurrency (via packing), the increase in processing throughput (and the corresponding loss in precision) depends on the input data statistics: low-variance parts of the matrix multiplication are computed faster than high-variance parts and the error is controlled in a stochastic and not in a worst-case sense. This can convert high-performance numerical DSP libraries into a computation channel where the output error increases when higher throughput is requested. Potential DSP applications that can benefit from the proposed approach are highlighted.
Keywords :
digital libraries; digital signal processing chips; linear algebra; matrix multiplication; process control; stochastic systems; DSP method; GEMM; companding process control; computationally-demanding digital signal processing system; data statistics; floating point; generic matrix multiplication; generic matrix multiply subprogram; high performance numerical DSP library; high-performance digital signal processing; high-performance linear algebra software; stochastic control; throughput-precision computation channel; tight packing; Acceleration; Digital signal processing; Kernel; Program processors; Signal to noise ratio; Throughput; BLAS level-3; high performance computing; matrix-based digital signal processing; stochastic error estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Signal Processing (DSP), 2011 17th International Conference on
Conference_Location :
Corfu
ISSN :
Pending
Print_ISBN :
978-1-4577-0273-0
Type :
conf
DOI :
10.1109/ICDSP.2011.6004973
Filename :
6004973
Link To Document :
بازگشت