DocumentCode
2316140
Title
Bit matrix multiplication in commodity processors
Author
Hilewitz, Yedidya ; Lauradoux, Cédric ; Lee, Ruby B.
Author_Institution
Dept. of Electr. Eng., Princeton Univ., Princeton, NJ
fYear
2008
fDate
2-4 July 2008
Firstpage
7
Lastpage
12
Abstract
Registers in processors generally contain words or, with the addition of multimedia extensions, short vectors of subwords of bytes or 16-bit elements. In this paper, we view the contents of registers as vectors or matrices of individual bits. However, the facility to operate efficiently on the bit-level is generally lacking. A commodity processor usually only has logical and shift instructions and occasionally population count instructions. Perhaps the most powerful primitive bit-level operation is the bit matrix multiply (BMM) instruction, currently found only in supercomputers like Cray. This instruction multiplies two ntimesn bit matrices. In this paper, we show the power of BMM. We propose and analyze new processor instructions that implement simpler BMM primitive operations more suitable for a commodity processor. We show the impact of BMM on the performance of critical application kernels and discuss its hardware cost.
Keywords
Cray computers; matrix multiplication; microprocessor chips; shift registers; BMM; Cray supercomputer; bit matrix multiplication; commodity processors; logical instructions; processor instructions; shift instructions; Acceleration; Application software; Costs; Hardware; Kernel; Microprocessors; Parallel processing; Registers; Scattering; Supercomputers;
fLanguage
English
Publisher
ieee
Conference_Titel
Application-Specific Systems, Architectures and Processors, 2008. ASAP 2008. International Conference on
Conference_Location
Leuven
ISSN
2160-0511
Print_ISBN
978-1-4244-1897-8
Electronic_ISBN
2160-0511
Type
conf
DOI
10.1109/ASAP.2008.4580146
Filename
4580146
Link To Document