Title :
Two level decomposition based matrix multiplication for FPGAs
Author :
Gao, Shuli ; Al-Khalili, Dhamin ; Chabini, Noureddine
Author_Institution :
Dept. of ECE, R. Mil. Coll. of Canada, Kingston, ON, Canada
Abstract :
In this paper, we present an efficient design approach for the implementation of large size matrix multiplication with wide bit size elements targeting FPGAs. The proposed technique first partitions the input matrices into smaller dimensions, and then segment the word-width of the elements into smaller sections. The segmentation is driven by the architecture of the targeted FPGA platform. A highly optimized scalar signed multiplier has been developed, and used as a basic block to construct a 2 by 2 matrix multiplier on Xilinx´ and Altera´s FPGAs. The result of the implementations showed that our method has outperformed the techniques utilized by commercial tools, ISE and Quartus, and the balanced word-width decomposition approach to realize the matrix multiplications proposed in. Compared to these approaches we achieved delay reduction range from 7.1% to 28% and area saving range from 6.7% to 32%.
Keywords :
field programmable gate arrays; matrix multiplication; Altera FPGA; FPGA; Xilinx FPGA; highly optimized scalar signed multiplier; input matrices; matrix multiplication; two level decomposition; wide bit size elements; wordwidth decomposition; Arithmetic; Delay; Design optimization; Educational institutions; Field programmable gate arrays; Matrix decomposition; Signal processing; Signal processing algorithms;
Conference_Titel :
Electronics, Circuits, and Systems, 2009. ICECS 2009. 16th IEEE International Conference on
Conference_Location :
Yasmine Hammamet
Print_ISBN :
978-1-4244-5090-9
Electronic_ISBN :
978-1-4244-5091-6
DOI :
10.1109/ICECS.2009.5410901