• DocumentCode
    2265699
  • Title

    An FPGA-based stream processor for embedded real-time vision with Convolutional Networks

  • Author

    Farabet, Clément ; Poulet, Cyril ; LeCun, Yann

  • Author_Institution
    Courant Inst. of Math. Sci., New York Univ., New York, NY, USA
  • fYear
    2009
  • fDate
    Sept. 27 2009-Oct. 4 2009
  • Firstpage
    878
  • Lastpage
    885
  • Abstract
    Many recent visual recognition systems can be seen as being composed of multiple layers of convolutional filter banks, interspersed with various types of non-linearities. This includes Convolutional Networks, HMAX-type architectures, as well as systems based on dense SIFT features or Histogram of Gradients. This paper describes a highly-compact and low power embedded system that can run such vision systems at very high speed. A custom board built around a Xilinx Virtex-4 FPGA was built and tested. It measures 70 × 80 mm, and the complete system-FPGA, camera, memory chips, flash-consumes 15 watts in peak, and is capable of more than 4 × 109 multiply-accumulate operations per second in real vision application. This enables real-time implementations of object detection, object recognition, and vision-based navigation algorithms in small-size robots, micro-UAVs, and hand-held devices. Real-time face detection is demonstrated, with speeds of 10 frames per second at VGA resolution.
  • Keywords
    channel bank filters; computer vision; convolution; field programmable gate arrays; navigation; object detection; object recognition; FPGA-based stream processor; HMAX-type architectures; Xilinx Virtex-4 FPGA; convolutional filter banks; convolutional networks; embedded real-time vision; hand-held devices; histogram of gradients; microUAV; object detection; object recognition; vision-based navigation algorithms; visual recognition systems; Cameras; Embedded system; Field programmable gate arrays; Filter bank; Histograms; Machine vision; Object detection; Robot vision systems; Semiconductor device measurement; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th International Conference on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-4442-7
  • Electronic_ISBN
    978-1-4244-4441-0
  • Type

    conf

  • DOI
    10.1109/ICCVW.2009.5457611
  • Filename
    5457611