• DocumentCode
    3501334
  • Title

    A reconfigurable accelerator for neuromorphic object recognition

  • Author

    Sabarad, Jagdish ; Kestur, Srinidhi ; Park, Mi Sun ; Dantara, Dharav ; Narayanan, Vijaykrishnan ; Chen, Yang ; Khosla, Deepak

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2012
  • fDate
    Jan. 30 2012-Feb. 2 2012
  • Firstpage
    813
  • Lastpage
    818
  • Abstract
    Advances in neuroscience have enabled researchers to develop computational models of auditory, visual and learning perceptions in the human brain. HMAX, which is a biologically inspired model of the visual cortex, has been shown to outperform standard computer vision approaches for multi-class object recognition. HMAX, while computationally demanding, can be potentially applied in various applications such as autonomous vehicle navigation, unmanned surveillance and robotics. In this paper, we present a reconfigurable hardware accelerator for the time-consuming S2 stage of the HMAX model. The accelerator leverages spatial parallelism, dedicated wide data buses with on-chip memories to provide an energy efficient solution to enable adoption into embedded systems. We present a systolic array-based architecture which includes a run-time reconfigurable convolution engine which can perform multiple variable-sized convolutions in parallel. An automation flow is described for this accelerator which can generate optimal hardware configurations for a given algorithmic specification and also perform run-time configuration and execution seamlessly. Experimental results on Virtex-6 FPGA platforms show 5X to 11X speedups and 14X to 33X higher performance-per-Watt over a CNS-based implementation on a Tesla GPU.
  • Keywords
    biocomputing; hearing; object recognition; CNS-based implementation; HMAX; Tesla GPU; Virtex-6 FPGA platform; algorithmic specification; auditory perception; automation flow; autonomous vehicle navigation; computational model; embedded system; human brain; learning perception; multiclass object recognition; neuromorphic object recognition; neuroscience; on-chip memories; optimal hardware configuration; reconfigurable accelerator; reconfigurable hardware accelerator; robotics; run-time configuration; run-time reconfigurable convolution engine; spatial parallelism; systolic array-based architecture; unmanned surveillance; variable-sized convolution; visual cortex; visual perception; wide data buses; Computational modeling; Convolution; Field programmable gate arrays; Graphics processing unit; Hardware; Kernel; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Design Automation Conference (ASP-DAC), 2012 17th Asia and South Pacific
  • Conference_Location
    Sydney, NSW
  • ISSN
    2153-6961
  • Print_ISBN
    978-1-4673-0770-3
  • Type

    conf

  • DOI
    10.1109/ASPDAC.2012.6165067
  • Filename
    6165067