DocumentCode :
3533903
Title :
A fast parallel matrix multiplication reconfigurable unit utilized in face recognitions systems
Author :
Sotiropoulos, I. ; Papaefstathiou, I.
Author_Institution :
I&C Sch., Swiss Fed. Inst. of Technol. (EPFL), Lausanne, Switzerland
fYear :
2009
fDate :
Aug. 31 2009-Sept. 2 2009
Firstpage :
276
Lastpage :
281
Abstract :
In this paper we present a reconfigurable device which significantly improves the execution time of the most computational intensive functions of three of the most widely used face recognition algorithms; those tasks multiply very large dense matrices. The presented architecture utilizes numerous digital signal processing units (DSPs) organized in a parallel manner within a state-of-the-art FPGA device. In order to accelerate those functions we have implemented a ldquoblockedrdquo matrix multiplication algorithm which multiplies certain sub-matrices of fixed-point 32-bit numbers; the size of the sub-matrices has been selected so as to fully exploit the resources of the underlying reconfigurable device. Our system is up to 550 times faster than a conventional general purpose processor when implementing the most CPU intensive parts of a number of very widely used face identification schemes, whereas it is more than 40 times faster than the similar schemes implemented in reconfigurable devices. Moreover, our system is general enough so as to be efficiently utilized in any application incorporating fixed-point matrix multiplications.
Keywords :
digital signal processing chips; face recognition; field programmable gate arrays; fixed point arithmetic; matrix multiplication; CPU intensive part; DSP; computational intensive function; digital signal processing unit; face identification scheme; face recognition system; fixed-point 32-bit number; fixed-point arithmetic; fixed-point matrix multiplication; parallel blocked matrix multiplication reconfigurable unit; state-of-the-art FPGA device; Acceleration; Algorithm design and analysis; Bayesian methods; Databases; Digital signal processing; Face recognition; Field programmable gate arrays; Principal component analysis; Signal processing algorithms; Statistical analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field Programmable Logic and Applications, 2009. FPL 2009. International Conference on
Conference_Location :
Prague
ISSN :
1946-1488
Print_ISBN :
978-1-4244-3892-1
Type :
conf
DOI :
10.1109/FPL.2009.5272287
Filename :
5272287
Link To Document :
بازگشت