• DocumentCode
    735017
  • Title

    3D sparse quantization for feature learning in action recognition

  • Author

    Yang Zhao ; Hong Cheng ; Lu Yang

  • Author_Institution
    Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • fYear
    2015
  • fDate
    12-15 July 2015
  • Firstpage
    263
  • Lastpage
    267
  • Abstract
    Feature learning plays a crucial role in the successful human action recognition. There has been a number of approaches extracting action features from depth information and 3D skeletal data. However, either the skeleton information or the depth map is not accurate for feature learning unless complex descriptors are carefully designed and embedded. In this paper, we first propose a data sparsification technique to sparsify the cuboids in the depth video clip. Then, a novel formulation of the cuboid descriptor is proposed based on the 3D Sparse Quantization (3DSQ). Furthermore, we build a Spatial-Temporal Pyramid (STP) structure with max pooling to hierarchically represent the action sample in depth domain. We demonstrate our feature learning technique with action recognition tasks using the public MSRAction3D and MARDaily-Action3D datasets. Experimental results show that the proposed approach outperforms state-of-the-art feature learning approaches and significantly improves the action recognition accuracy.
  • Keywords
    feature extraction; gesture recognition; learning (artificial intelligence); video signal processing; 3D skeletal data; 3D sparse quantization; 3DSQ; MARDaily-Action3D dataset; STP structure; action recognition task; complex descriptor; cuboid descriptor; depth information; depth map; depth video clip; feature extraction; feature learning technique; human action recognition; max pooling; public MSRAction3D; skeleton information; spatial-temporal pyramid structure; Accuracy; Computer vision; Conferences; Feature extraction; Pattern recognition; Quantization (signal); Three-dimensional displays; 3D sparse quantization; data sparsification; human action recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
  • Conference_Location
    Chengdu
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2015.7230404
  • Filename
    7230404