DocumentCode :
3250307
Title :
NnSP: embedded neural networks stream processor
Author :
Esmaeilzadeh, Hadi ; Farzan, Farhang ; Shahidi, Neda ; Fakhraie, S.M. ; Lucas, Caro ; Tehranipoor, M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Tehran Univ.
fYear :
2005
fDate :
7-10 Aug. 2005
Firstpage :
223
Abstract :
Exploiting neural networks native parallelism and interaction locality, dedicated parallel hardware implementation of neural networks is essential for their effective use in time-critical applications. The architecture proposed in this paper is a parallel stream processor called neural networks stream processor or NnSP which can be programmed to realize different neural-network topologies and architectures. NnSP is a collection of programmable processing engines organized in custom FIFO-based cache architecture and busing system. Streams of synaptic data flow through the parallel processing elements, and computations are performed based on the instructions embedded in the preambles of the data streams. The command and configuration words embedded in the preamble of a stream, program each processing element to perform a desired computation on the upcoming data. The packetized nature of the stream architecture brings up a high degree of flexibility and scalability for NnSP. The stream processor is synthesized targeting an ASIC standard cell library for SoC implementation and also is realized on Xilinx VirtexII-Pro SoPC beds. A neural network employed for mobile robot navigation control, is implemented on the realized SoPC hardware. The realization-speedup achievements are presented here
Keywords :
neural nets; parallel processing; system-on-chip; ASIC standard cell library; FIFO-based cache architecture; NnSP; SoC implementation; Xilinx VirtexII-Pro SoPC beds; busing system; command words; configuration words; data streams; interaction locality; mobile robot navigation control; neural networks stream processor; parallel hardware implementation; parallel stream processor; programmable processing engines; synaptic data flow; time-critical applications; Computer aided instruction; Computer architecture; Concurrent computing; Embedded computing; Engines; Network topology; Neural network hardware; Neural networks; Parallel processing; Time factors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2005. 48th Midwest Symposium on
Conference_Location :
Covington, KY
Print_ISBN :
0-7803-9197-7
Type :
conf
DOI :
10.1109/MWSCAS.2005.1594079
Filename :
1594079
Link To Document :
بازگشت