DocumentCode :
252030
Title :
An efficient implementation of deep convolutional neural networks on a mobile coprocessor
Author :
Jonghoon Jin ; Gokhale, Vinayak ; Dundar, Aysegul ; Krishnamurthy, Bharadwaj ; Martini, Ben ; Culurciello, Eugenio
Author_Institution :
Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
fYear :
2014
fDate :
3-6 Aug. 2014
Firstpage :
133
Lastpage :
136
Abstract :
In this paper we present a hardware accelerated real-time implementation of deep convolutional neural networks (DCNNs). DCNNs are becoming popular because of advances in the processing capabilities of general purpose processors. However, DCNNs produce hundreds of intermediate results whose constant memory accesses result in inefficient use of general purpose processor hardware. By using an efficient routing strategy, we are able to maximize utilization of available hardware resources but also obtain high performance in real world applications. Our system, consisting of an ARM Cortex-A9 processor and a coprocessor, is capable of a peak performance of 40 G-ops/s while consuming less than 4W of power. The entire platform is in a small form factor which, combined with its high performance at low power consumption makes it feasible to use this hardware in applications like micro-UAVs, surveillance systems and autonomous robots.
Keywords :
autonomous aerial vehicles; coprocessors; low-power electronics; neural nets; ARM Cortex-A9 processor; autonomous robots; deep convolutional neural networks; hardware accelerated real-time implementation; low power consumption; memory accesses; microUAV; mobile coprocessor; surveillance systems; Mobile communication; Program processors; Robots; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (MWSCAS), 2014 IEEE 57th International Midwest Symposium on
Conference_Location :
College Station, TX
ISSN :
1548-3746
Print_ISBN :
978-1-4799-4134-6
Type :
conf
DOI :
10.1109/MWSCAS.2014.6908370
Filename :
6908370
Link To Document :
بازگشت