DocumentCode
1927719
Title
A mixed hardware-software approach to flexible Artificial Neural Network training on FPGA
Author
Aliaga, Ramón J. ; Gadea, Rafael ; Colom, Ricardo J. ; Cerdá, Joaquín ; Ferrando, Néstor ; Herrero, Vicente
Author_Institution
Inst. for the Implementation of Adv. Inf. & Commun. Technol. (ITACA), Univ. Politec. de Valencia, Valencia, Spain
fYear
2009
fDate
20-23 July 2009
Firstpage
1
Lastpage
8
Abstract
FPGAs offer a promising platform for the implementation of artificial neural networks (ANNs) and their training, combining the use of custom optimized hardware with low cost and fast development time. However, purely hardware realizations tend to focus on throughput, resorting to restrictions on applicable network topology or low-precision data representation, whereas flexible solutions allowing a wide variation of network parameters and training algorithms are usually restricted to software implementations. This paper proposes a mixed approach, introducing a system-on-chip (SoC) implementation where computations are carried out by a high efficiency neural coprocessor with a large number of parallel processing elements. System flexibility is provided by on-chip software control and the use of floating-point arithmetic, and network parallelism is exploited through replicated logic and application-specific coprocessor architecture, leading to fast training time. Performance results and design limitations and trade-offs are discussed.
Keywords
artificial intelligence; coprocessors; field programmable gate arrays; learning (artificial intelligence); system-on-chip; FPGA; application-specific coprocessor architecture; flexible artificial neural network training; floating-point arithmetic; low-precision data representation; mixed hardware-software approach; network parallelism; network topology; neural coprocessor; on-chip software control; parallel processing elements; system flexibility; system-on-chip; Artificial neural networks; Coprocessors; Cost function; Field programmable gate arrays; Network topology; Neural network hardware; Parallel processing; Software algorithms; System-on-a-chip; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Architectures, Modeling, and Simulation, 2009. SAMOS '09. International Symposium on
Conference_Location
Samos
Print_ISBN
978-1-4244-4502-8
Type
conf
DOI
10.1109/ICSAMOS.2009.5289235
Filename
5289235
Link To Document