• DocumentCode
    188137
  • Title

    Accelerator of Stacked Convolutional Independent Subspace Analysis for Deep Learning-Based Action Recognition

  • Author

    Lu He ; Yan Luo ; Yu Cao

  • Author_Institution
    Electr. & Comput. Eng., Univ. of Massachusetts, Lowell, MA, USA
  • fYear
    2014
  • fDate
    11-13 May 2014
  • Firstpage
    104
  • Lastpage
    104
  • Abstract
    Action recognition has been a research challenge in multimedia computing and machine vision. Recent advances in deep learning combined with stacked convolutional Independent Subspace Analysis (ISA) has achieved a better performance superior to all previously published results on several public available data sets. Unfortunately, one major issue in large-scale deployment of this new deep learning-based approach is the unacceptable latency of training with high-dimension data. In this paper, we propose a new hardware accelerator that can reduce the training time substantially for deep learning-based action recognition. Specifically, our proposed approach focuses on accelerating the convolutional stacked ISA algorithm, the core components of the deep learning-based action recognition algorithms. We design parallel pipelines, data parallelisms and look-up table to speed up the algorithm. With an embedded heterogeneous platform consisting of a general purpose processor and a FPGA, we are able to achieve up to 10X speedup for stacked ISA training compared to a software-only implementation.
  • Keywords
    computer vision; field programmable gate arrays; gesture recognition; learning (artificial intelligence); multimedia systems; FPGA; data parallelism; deep learning-based action recognition; embedded heterogeneous platform; general purpose processor; large-scale deployment; look-up table; machine vision; multimedia computing; parallel pipelines; stacked convolutional independent subspace analysis accelerator; Acceleration; Algorithm design and analysis; Computers; Educational institutions; Field programmable gate arrays; Pipeline processing; Training; Accelerator; Deep Learning; Independent Subspace Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Custom Computing Machines (FCCM), 2014 IEEE 22nd Annual International Symposium on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4799-5110-9
  • Type

    conf

  • DOI
    10.1109/FCCM.2014.37
  • Filename
    6861599