DocumentCode :
981618
Title :
ACE16k: the third generation of mixed-signal SIMD-CNN ACE chips toward VSoCs
Author :
Rodriguez-Vazquez, Angel ; Linan-Cembrano, Gustavo ; Carranza, L. ; Roca-Moreno, Elisenda ; Carmona-Galan, R. ; Jimenez-Garrido, F. ; Dominguez-Castro, R. ; Meana, Servando Espejo
Author_Institution :
Centro Nacional de Microelectron., Univ. de Sevilla, Seville, Spain
Volume :
51
Issue :
5
fYear :
2004
fDate :
5/1/2004 12:00:00 AM
Firstpage :
851
Lastpage :
863
Abstract :
Today, with 0.18-μm technologies mature and stable enough for mixed-signal design with a large variety of CMOS compatible optical sensors available and with 0.09-μm technologies knocking at the door of designers, we can face the design of integrated systems, instead of just integrated circuits. In fact, significant progress has been made in the last few years toward the realization of vision systems on chips (VSoCs). Such VSoCs are eventually targeted to integrate within a semiconductor substrate the functions of optical sensing, image processing in space and time, high-level processing, and the control of actuators. The consecutive generations of ACE chips define a roadmap toward flexible VSoCs. These chips consist of arrays of mixed-signal processing elements (PEs) which operate in accordance with single instruction multiple data (SIMD) computing architectures and exhibit the functional features of CNN Universal Machines. They have been conceived to cover the early stages of the visual processing path in a fully-parallel manner, and hence more efficiently than DSP-based systems. Across the different generations, different improvements and modifications have been made looking to converge with the newest discoveries of neurobiologists regarding the behavior of natural retinas. This paper presents considerations pertaining to the design of a member of the third generation of ACE chips, namely to the so-called ACE16k chip. This chip, designed in a 0.35-μm standard CMOS technology, contains about 3.75 million transistors and exhibits peak computing figures of 330 GOPS, 3.6 GOPS/mm2 and 82.5 GOPS/W. Each PE in the array contains a reconfigurable computing kernel capable of calculating linear convolutions on 3×3 neighborhoods in less than 1.5 μs, imagewise Boolean combinations in less than 200 ns, imagewise arithmetic operations in about 5 μs, and CNN-like temporal evolutions with a time constant of about 0.5 μs. Unfortunately, the many ideas underlying the design of this chip cannot be covered in a single paper; hence, this paper is focused on, first, placing the ACE16k in the ACE chip roadmap and, then, discussing the most significant modifications of ACE16K versus its predecessors in the family.
Keywords :
CMOS image sensors; computer vision; digital signal processing chips; image processing; parallel architectures; programmable circuits; ACE chips; Boolean combinations; CMOS technology; CNN Universal Machines; DSP-based systems; VLSI; actuators; computing architectures; image processing; imagewise arithmetic operations; instruction multiple data; integrated circuits; integrated systems design; mixed-signal design; mixed-signal processing elements; optical sensing; optical sensors; semiconductor substrate; silicon retinas; vision chips; vision systems on chips; visual processing path; CMOS integrated circuits; CMOS technology; Face detection; Integrated circuit technology; Machine vision; Optical design; Optical sensors; Photonic integrated circuits; Substrates; System-on-a-chip; Analog programmable very large-scale integration; VLSI; early vision chips; silicon retinas;
fLanguage :
English
Journal_Title :
Circuits and Systems I: Regular Papers, IEEE Transactions on
Publisher :
ieee
ISSN :
1549-8328
Type :
jour
DOI :
10.1109/TCSI.2004.827621
Filename :
1296799
Link To Document :
بازگشت