Title :
Configurable Multimode Embedded Floating-Point Units for FPGAs
Author :
Chong, Yee Jern ; Parameswaran, Sri
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of New South Wales, Sydney, NSW, Australia
Abstract :
Performance of field-programmable gate arrays (FPGAs) used for floating-point applications is poor due to the complexity of floating-point arithmetic. Implementing floating-point units (FPUs) on FPGAs consume a large amount of resources. This makes FPGAs less attractive for use in floating-point intensive applications. Therefore, there is a need for embedded FPUs in FPGAs. However, if unutilized, embedded FPUs waste space on the FPGA die. To overcome this issue, we propose a flexible multimode embedded FPU for FPGAs that can be configured to perform a wide range of operations. The floating-point adder and multiplier in our embedded FPU can each be configured to perform one double-precision operation or two single-precision operations in parallel. To increase flexibility further, access to the large integer multiplier, adder and shifters in the FPU is provided. Benchmark circuits were implemented on both a standard Xilinx Virtex-II FPGA and on our FPGA with embedded FPU blocks. The results using our embedded FPUs showed a mean area improvement of 5.5 times and a mean delay improvement of 5.8 times for the double-precision benchmarks, and a mean area improvement of 3.8 times and a mean delay improvement of 4.2 times for the single-precision benchmarks. The embedded FPUs were also shown to provide significant area and delay benefits for fixed-point and integer circuits.
Keywords :
adders; field programmable gate arrays; floating point arithmetic; Xilinx Virtex-II FPGA; configurable multimode embedded floating-point unit; double-precision benchmark; double-precision operation; field-programmable gate array; fixed-point circuit; flexible multimode embedded FPU; floating-point adder; floating-point arithmetic; floating-point intensive application; floating-point multiplier; integer circuit; single-precision operation; Adders; Benchmark testing; Computer architecture; Delay; Field programmable gate arrays; Hardware; Routing; Dual-precision; FPGA architecture; embedded block; field-programmable gate array (FPGA); floating-point; floating-point unit (FPU);
Journal_Title :
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
DOI :
10.1109/TVLSI.2010.2072996