• DocumentCode
    110677
  • Title

    Action Recognition Using Nonnegative Action Component Representation and Sparse Basis Selection

  • Author

    Haoran Wang ; Chunfeng Yuan ; Weiming Hu ; Haibin Ling ; Wankou Yang ; Changyin Sun

  • Author_Institution
    Sch. of Autom., Southeast Univ., Nanjing, China
  • Volume
    23
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    570
  • Lastpage
    581
  • Abstract
    In this paper, we propose using high-level action units to represent human actions in videos and, based on such units, a novel sparse model is developed for human action recognition. There are three interconnected components in our approach. First, we propose a new context-aware spatial-temporal descriptor, named locally weighted word context, to improve the discriminability of the traditionally used local spatial-temporal descriptors. Second, from the statistics of the context-aware descriptors, we learn action units using the graph regularized nonnegative matrix factorization, which leads to a part-based representation and encodes the geometrical information. These units effectively bridge the semantic gap in action recognition. Third, we propose a sparse model based on a joint l2,1-norm to preserve the representative items and suppress noise in the action units. Intuitively, when learning the dictionary for action representation, the sparse model captures the fact that actions from the same class share similar units. The proposed approach is evaluated on several publicly available data sets. The experimental results and analysis clearly demonstrate the effectiveness of the proposed approach.
  • Keywords
    graph theory; image coding; image recognition; image representation; matrix decomposition; noise abatement; statistical analysis; context-aware spatial-temporal descriptor; geometrical information encoding; graph regularized nonnegative matrix factorization; human action recognition; human action representation; interconnected component; joint l2,1-norm sparse model; locally weighted word context; noise suppression; nonnegative action component representation; semantic gap; sparse basis selection; Context; Dictionaries; Feature extraction; Semantics; Vectors; Videos; Visualization; Action unit; action recognition; nonnegative matrix factorization; sparse representation;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2013.2292550
  • Filename
    6675065