Title :
Architecture Exploration of High-Performance Floating-Point Fused Multiply-Add Units and their Automatic Use in High-Level Synthesis
Author :
Liebig, Bjorn ; Huthmann, Jens ; Koch, Andreas
Abstract :
Multiply-add operations form a crucial part of many digital signal processing and control engineering applications. Since their performance is crucial for the application-level speed-up, it is worthwhile to explore a wide spectrum of implementations alternatives, trading increased area/energy usage to speed-up units on the critical path of the computation. This paper examines existing solutions and proposes two new architectures for floating-point fused multiply-adds, and also considers the impact of different in-fabric features of recent FPGA architectures. The units rely on different degrees of carry-save arithmetic improve performance by up to 2.5x over the closest state-of-the-art competitor. They are evaluated at the application level by modifying an existing high-level synthesis system to automatically insert the new units for computations on the critical path of three different convex solvers.
Keywords :
field programmable gate arrays; floating point arithmetic; high level synthesis; FPGA architectures; application-level speed-up; architecture exploration; convex solvers; high-level synthesis; high-performance floating-point fused multiply-add units; in-fabric features; multiply-add operations; state-of-the-art competitor; Acceleration; Accuracy; Adders; Computer architecture; Field programmable gate arrays; Multiplexing; Standards; FMA; FPGA; carry save; floating-point; fused; multiply-add;
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
DOI :
10.1109/IPDPSW.2013.106