• DocumentCode
    3250307
  • Title

    NnSP: embedded neural networks stream processor

  • Author

    Esmaeilzadeh, Hadi ; Farzan, Farhang ; Shahidi, Neda ; Fakhraie, S.M. ; Lucas, Caro ; Tehranipoor, M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Tehran Univ.
  • fYear
    2005
  • fDate
    7-10 Aug. 2005
  • Firstpage
    223
  • Abstract
    Exploiting neural networks native parallelism and interaction locality, dedicated parallel hardware implementation of neural networks is essential for their effective use in time-critical applications. The architecture proposed in this paper is a parallel stream processor called neural networks stream processor or NnSP which can be programmed to realize different neural-network topologies and architectures. NnSP is a collection of programmable processing engines organized in custom FIFO-based cache architecture and busing system. Streams of synaptic data flow through the parallel processing elements, and computations are performed based on the instructions embedded in the preambles of the data streams. The command and configuration words embedded in the preamble of a stream, program each processing element to perform a desired computation on the upcoming data. The packetized nature of the stream architecture brings up a high degree of flexibility and scalability for NnSP. The stream processor is synthesized targeting an ASIC standard cell library for SoC implementation and also is realized on Xilinx VirtexII-Pro SoPC beds. A neural network employed for mobile robot navigation control, is implemented on the realized SoPC hardware. The realization-speedup achievements are presented here
  • Keywords
    neural nets; parallel processing; system-on-chip; ASIC standard cell library; FIFO-based cache architecture; NnSP; SoC implementation; Xilinx VirtexII-Pro SoPC beds; busing system; command words; configuration words; data streams; interaction locality; mobile robot navigation control; neural networks stream processor; parallel hardware implementation; parallel stream processor; programmable processing engines; synaptic data flow; time-critical applications; Computer aided instruction; Computer architecture; Concurrent computing; Embedded computing; Engines; Network topology; Neural network hardware; Neural networks; Parallel processing; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2005. 48th Midwest Symposium on
  • Conference_Location
    Covington, KY
  • Print_ISBN
    0-7803-9197-7
  • Type

    conf

  • DOI
    10.1109/MWSCAS.2005.1594079
  • Filename
    1594079