DocumentCode :
1013485
Title :
Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning
Author :
Ali, Saad ; Shah, Mubarak
Author_Institution :
Robot. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
32
Issue :
2
fYear :
2010
Firstpage :
288
Lastpage :
303
Abstract :
We propose a set of kinematic features that are derived from the optical flow for human action recognition in videos. The set of kinematic features includes divergence, vorticity, symmetric and antisymmetric flow fields, second and third principal invariants of flow gradient and rate of strain tensor, and third principal invariant of rate of rotation tensor. Each kinematic feature, when computed from the optical flow of a sequence of images, gives rise to a spatiotemporal pattern. It is then assumed that the representative dynamics of the optical flow are captured by these spatiotemporal patterns in the form of dominant kinematic trends or kinematic modes. These kinematic modes are computed by performing principal component analysis (PCA) on the spatiotemporal volumes of the kinematic features. For classification, we propose the use of multiple instance learning (MIL) in which each action video is represented by a bag of kinematic modes. Each video is then embedded into a kinematic-mode-based feature space and the coordinates of the video in that space are used for classification using the nearest neighbor algorithm. The qualitative and quantitative results are reported on the benchmark data sets.
Keywords :
image classification; image sequences; learning (artificial intelligence); principal component analysis; tensors; video signal processing; feature space; flow gradient; human action recognition; image sequence; kinematic features; kinematic mode; kinematic trend; multiple instance learning; nearest neighbor algorithm; optical flow; principal component analysis; rotation tensor; spatiotemporal pattern; strain tensor; video classification; Capacitive sensors; Humans; Image motion analysis; Kinematics; Nearest neighbor searches; Optical computing; Principal component analysis; Spatiotemporal phenomena; Tensile stress; Videos; Action recognition; Feature representation; Motion; Video analysis; kinematic features.; motion; principal component analysis; video analysis; Algorithms; Artificial Intelligence; Biomechanics; Humans; Movement; Pattern Recognition, Automated; Principal Component Analysis; Video Recording;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2008.284
Filename :
4693713
Link To Document :
بازگشت