• DocumentCode
    2102545
  • Title

    The multi 2D systolic design and implementation of Convolutional Neural Networks

  • Author

    Dawwd, Shefa A.

  • Author_Institution
    Comput. Eng. Dept., Univ. of Mosul, Mosul, Iraq
  • fYear
    2013
  • fDate
    8-11 Dec. 2013
  • Firstpage
    221
  • Lastpage
    224
  • Abstract
    A novel Systolic Convolutional Neural Network (SCoNN) architecture suitable to be implemented in a single hardware die is proposed in this paper. A large size of memory is required to store the input and intermediate data resulted among convolutional layers in the CoNN. However, to parallelize the computations, multiple memory accesses should be achieved simultaneously. Consequently, memory bandwidth should be increased by the use of either fast expensive components or complicated managed interleaved scheme. Our proposed system can address this challenge efficiently. This is achieved due to the use of smaller memory to store the input pattern and to re-use it as an intermediate memory for inter-processing patterns that produced between two convolutional layers. Using systolic technique which can efficiently exploit its input leads to achieve this goal. A high parallelism level can be achieved with a minimal number of memory accesses. Also, the proposed computational units is designed to be suitable for processing an image of scaled resolutions. The reason behind that is attributed to using a sliding window architecture to implement the convolution process. Now apart from the size of the input pattern or window and instead of feeding the window across the image, the image is fed through the window due to using FIFO caching scheme. According to these advantages, an easily, efficient, and scalable SCoNN architecture can be implemented using small FPGA resources.
  • Keywords
    convolution; image resolution; neural net architecture; parallel processing; FIFO caching scheme; FPGA resources; convolution process; convolutional layers; image processing; interprocessing patterns; memory accesses; memory bandwidth; multi2D systolic design; parallelism level; scalable SCoNN architecture; scaled resolutions; sliding window architecture; systolic convolutional neural network architecture; systolic technique; Arrays; Convolution; Feature extraction; Hardware; Random access memory; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Circuits, and Systems (ICECS), 2013 IEEE 20th International Conference on
  • Conference_Location
    Abu Dhabi
  • Type

    conf

  • DOI
    10.1109/ICECS.2013.6815394
  • Filename
    6815394