Title :
FPGA-based implementation of M4RM for matrix multiplication over GF(2)
Author :
Kumar, Vipin ; Kumar, Vinay B. Y. ; Patkar, Sachin B.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Bombay, Mumbai, India
Abstract :
The Method of Four Russians for Multiplication (M4RM) is one of the most efficient algorithms for dense matrix multiplication over binary field targeting particularly the commodity general purpose processors. We present an efficient tile-based hardware/software implementation of M4RM, with the hardware side handling the constituent block multiplications in a streaming fashion, and the software side doing the accumulations. With designs for 64 × 64 and 128 × 128 sized block matrix multiplications, sizes feasible for targeting FPGAs, we compare the performance with the fastest software implementations of M4RM on commodity processors. The designs were implemented in Bluespec SystemVerilog, and evaluated over the hardware/software co-emulation framework, SCE-MI. Using the 128 × 128 hardware modules, a 16, 384 × 16, 384 matrix multiplication, running at 140 MHz could be done in ~ 3.0s using the Strassen-Winograd scheme when targeting a Cyclone IV FPGA and at a sustained bit operations per cycle of ~ 8000; where, in comparision, M4RM on Intel Core2Duo running at 2.33GHz, takes ~ 8s and at a sustained bit operations per cycle of ~ 500.
Keywords :
Galois fields; field programmable gate arrays; hardware-software codesign; logic design; matrix multiplication; Bluespec SystemVerilog; Cyclone IV FPGA; GF(2); Intel Core2Duo; M4RM; SCE-MI; Strassen-Winograd scheme; binary field; block matrix multiplications; block multiplications; commodity general purpose processors; dense matrix multiplication; frequency 140 MHz; frequency 2.33 GHz; hardware modules; hardware-software co-emulation framework; method of four Russians for multiplication; tile-based hardware-software; Algorithm design and analysis; Cyclones; Field programmable gate arrays; Hardware; Ports (Computers); Program processors;
Conference_Titel :
VLSI Design and Test, 18th International Symposium on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4799-5088-1
DOI :
10.1109/ISVDAT.2014.6881072