Title :
Scalable and modular algorithms for floating-point matrix multiplication on FPGAs
Author :
Zhuo, Ling ; Prasanna, Viktor K.
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
Abstract :
Summary form only given. The abundant hardware resources on current FPGAs provide new opportunities to improve the performance of hardware implementations of scientific computations. We propose two FPGA-based algorithms for floating-point matrix multiplication, a fundamental kernel in a number of scientific applications. We analyze the design tradeoffs in implementing this kernel on FPGAs. Our algorithms employ a linear array architecture with a small control logic. This architecture effectively utilizes the hardware resources on the entire FPGA and reduces the routing complexity. The processing elements (PEs) used in our algorithms are modular so that floating-point units can be easily embedded into them. In our designs, the floating-point units are optimized to maximize the number of PEs integrated on the FPGA as well as the clock speed. Experimental results show that our algorithms achieve high clock speeds and provide good scalability. Our algorithms achieve superior sustained floating-point performance compared with existing FPGA-based implementations and state-of-the-art processors.
Keywords :
clocks; computational complexity; field programmable gate arrays; floating point arithmetic; logic design; matrix multiplication; optimisation; FPGA-based algorithm; array architecture; clock speed; control logic; floating-point matrix multiplication; hardware resources; kernel design; modular algorithm; processing element maximization; routing complexity; scientific computation; state-of-the-art processor; Algorithm design and analysis; Clocks; Computer architecture; Field programmable gate arrays; Floating-point arithmetic; Hardware; Kernel; Logic arrays; Logic devices; Routing;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
DOI :
10.1109/IPDPS.2004.1303036