DocumentCode :
3535227
Title :
A multi-FPGA architecture for stochastic Restricted Boltzmann Machines
Author :
Ly, Daniel L. ; Chow, Paul
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
fYear :
2009
fDate :
Aug. 31 2009-Sept. 2 2009
Firstpage :
168
Lastpage :
173
Abstract :
Although there are many neural network FPGA architectures, there is no framework for designing large, high-performance neural networks suitable for the real world. In this paper, we present two concepts to support a multi-FPGA architecture for stochastic restricted Boltzmann machines (RBM), a popular type of neural network. First, a hardware core, called the kth stage piecewise linear interpolator, is used to implement a high-precision, pipelined function generator. The interpolator increases the resolution of a look up table implementation, guaranteeing an additional bit of precision for every pipeline stage. This function generator is used to implement a sigmoid function required in stochastic node selection. Next, a partitioning algorithm is used to efficiently divide a RBM amongst multiple FPGAs. The partitioning algorithm optimizes performance by minimizing the inter-FPGA communication. The architecture is tested on the Berkeley Emulation Engine 2 running at 100 MHz. One board supports a RBM of 256 times 256 nodes, and results in a computational speed of 1.85 billion connection-updatesper- second and a speed-up of 85-fold over an optimized C program running on a 2.8 GHz Intel processor.
Keywords :
Boltzmann machines; field programmable gate arrays; function generators; pipeline processing; stochastic processes; table lookup; Berkeley Emulation Engine 2; Intel processor; frequency 100 MHz; frequency 2.8 GHz; kth stage piecewise linear interpolator; look up table; neural network FPGA architectures; optimized C program; partitioning algorithm; pipelined function generator; sigmoid function; stochastic node selection; stochastic restricted Boltzmann machines; Computer architecture; Field programmable gate arrays; Hardware; Neural networks; Partitioning algorithms; Piecewise linear techniques; Pipelines; Signal generators; Stochastic processes; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field Programmable Logic and Applications, 2009. FPL 2009. International Conference on
Conference_Location :
Prague
ISSN :
1946-1488
Print_ISBN :
978-1-4244-3892-1
Electronic_ISBN :
1946-1488
Type :
conf
DOI :
10.1109/FPL.2009.5272516
Filename :
5272516
Link To Document :
بازگشت