Title :
High-performance FPGA-based general reduction methods
Author :
Morris, Gerald R. ; Zhuo, Ling ; Prasanna, Viktor K.
Author_Institution :
Dept. of Electr. Eng., South Carolina Univ., Columbia, SC, USA
Abstract :
FPGA-based floating-point kernels must exploit algorithmic parallelism and use deeply pipelined cores to gain a performance advantage over general-purpose processors. Inability to hide the latency of lengthy pipelines can significantly reduce the performance or impose unrealistic buffer requirements. Designs requiring reduction operations such as accumulation are particularly susceptible. In this paper we introduce two high-performance FPGA-based methods for reducing multiple sets of sequentially delivered floating-point values in optimal time without stalling the pipeline.
Keywords :
field programmable gate arrays; floating point arithmetic; pipeline arithmetic; FPGA-based floating-point kernel; algorithmic parallelism; field programmable gate array; floating-point value reduction; pipelined core; Adders; Buffer overflow; Circuits; Crops; Delay; Design methodology; Kernel; Performance gain; Pipelines; Steady-state;
Conference_Titel :
Field-Programmable Custom Computing Machines, 2005. FCCM 2005. 13th Annual IEEE Symposium on
Print_ISBN :
0-7695-2445-1
DOI :
10.1109/FCCM.2005.42