• DocumentCode
    2315095
  • Title

    3D Human Action Recognition for Multi-view Camera Systems

  • Author

    Holte, Michael B. ; Moeslund, Thomas B. ; Nikolaidis, Nikos ; Pitas, Ioannis

  • Author_Institution
    Dept. of Archit., Design & Media Technol., Aalborg Univ., Aalborg, Denmark
  • fYear
    2011
  • fDate
    16-19 May 2011
  • Firstpage
    342
  • Lastpage
    349
  • Abstract
    This paper presents a novel approach for combining optical flow into enhanced 3D motion vector fields for human action recognition. Our approach detects motion of the actors by computing optical flow in video data captured by a multi-view camera setup with an arbitrary number of views. Optical flow is estimated in each view and extended to 3D using 3D reconstructions of the actors and pixel-to-vertex correspondences. The resulting 3D optical flow for each view is combined into a 3D motion vector field by taking the significance of local motion and its reliability into account. 3D Motion Context (3D-MC) and Harmonic Motion Context (HMC) are used to represent the extracted 3D motion vector fields efficiently and in a view-invariant manner, while considering difference in anthropometry of the actors and their movement style variations. The resulting 3D-MC and HMC descriptors are classified into a set of human actions using normalized correlation, taking into account the performing speed variations of different actors. We compare the performance of the 3D-MC and HMC descriptors, and show promising experimental results for the publicly available i3DPost Multi View Human Action Dataset.
  • Keywords
    cameras; feature extraction; image enhancement; image motion analysis; image sequences; object recognition; 3D human action recognition; 3D motion context; extracted 3D motion vector fields; harmonic motion context; i3DPost multiview human action dataset; multiview camera systems; optical flow; video data; Cameras; Context; Harmonic analysis; Humans; Optical imaging; Shape; Three dimensional displays; 3D motion description; 3D optical flow; human action recognition; multi-view;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT), 2011 International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-61284-429-9
  • Electronic_ISBN
    978-0-7695-4369-7
  • Type

    conf

  • DOI
    10.1109/3DIMPVT.2011.50
  • Filename
    5955380