DocumentCode :
2102545
Title :
The multi 2D systolic design and implementation of Convolutional Neural Networks
Author :
Dawwd, Shefa A.
Author_Institution :
Comput. Eng. Dept., Univ. of Mosul, Mosul, Iraq
fYear :
2013
fDate :
8-11 Dec. 2013
Firstpage :
221
Lastpage :
224
Abstract :
A novel Systolic Convolutional Neural Network (SCoNN) architecture suitable to be implemented in a single hardware die is proposed in this paper. A large size of memory is required to store the input and intermediate data resulted among convolutional layers in the CoNN. However, to parallelize the computations, multiple memory accesses should be achieved simultaneously. Consequently, memory bandwidth should be increased by the use of either fast expensive components or complicated managed interleaved scheme. Our proposed system can address this challenge efficiently. This is achieved due to the use of smaller memory to store the input pattern and to re-use it as an intermediate memory for inter-processing patterns that produced between two convolutional layers. Using systolic technique which can efficiently exploit its input leads to achieve this goal. A high parallelism level can be achieved with a minimal number of memory accesses. Also, the proposed computational units is designed to be suitable for processing an image of scaled resolutions. The reason behind that is attributed to using a sliding window architecture to implement the convolution process. Now apart from the size of the input pattern or window and instead of feeding the window across the image, the image is fed through the window due to using FIFO caching scheme. According to these advantages, an easily, efficient, and scalable SCoNN architecture can be implemented using small FPGA resources.
Keywords :
convolution; image resolution; neural net architecture; parallel processing; FIFO caching scheme; FPGA resources; convolution process; convolutional layers; image processing; interprocessing patterns; memory accesses; memory bandwidth; multi2D systolic design; parallelism level; scalable SCoNN architecture; scaled resolutions; sliding window architecture; systolic convolutional neural network architecture; systolic technique; Arrays; Convolution; Feature extraction; Hardware; Random access memory; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics, Circuits, and Systems (ICECS), 2013 IEEE 20th International Conference on
Conference_Location :
Abu Dhabi
Type :
conf
DOI :
10.1109/ICECS.2013.6815394
Filename :
6815394
Link To Document :
بازگشت