Title :
Bit matrix multiplication in commodity processors
Author :
Hilewitz, Yedidya ; Lauradoux, Cédric ; Lee, Ruby B.
Author_Institution :
Dept. of Electr. Eng., Princeton Univ., Princeton, NJ
Abstract :
Registers in processors generally contain words or, with the addition of multimedia extensions, short vectors of subwords of bytes or 16-bit elements. In this paper, we view the contents of registers as vectors or matrices of individual bits. However, the facility to operate efficiently on the bit-level is generally lacking. A commodity processor usually only has logical and shift instructions and occasionally population count instructions. Perhaps the most powerful primitive bit-level operation is the bit matrix multiply (BMM) instruction, currently found only in supercomputers like Cray. This instruction multiplies two ntimesn bit matrices. In this paper, we show the power of BMM. We propose and analyze new processor instructions that implement simpler BMM primitive operations more suitable for a commodity processor. We show the impact of BMM on the performance of critical application kernels and discuss its hardware cost.
Keywords :
Cray computers; matrix multiplication; microprocessor chips; shift registers; BMM; Cray supercomputer; bit matrix multiplication; commodity processors; logical instructions; processor instructions; shift instructions; Acceleration; Application software; Costs; Hardware; Kernel; Microprocessors; Parallel processing; Registers; Scattering; Supercomputers;
Conference_Titel :
Application-Specific Systems, Architectures and Processors, 2008. ASAP 2008. International Conference on
Conference_Location :
Leuven
Print_ISBN :
978-1-4244-1897-8
Electronic_ISBN :
2160-0511
DOI :
10.1109/ASAP.2008.4580146