Title :
Cross-View Action Modeling, Learning, and Recognition
Author :
Jiang Wang ; Xiaohan Nie ; Yin Xia ; Ying Wu ; Song-Chun Zhu
Abstract :
Existing methods on video-based action recognition are generally view-dependent, i.e., performing recognition from the same views seen in the training data. We present a novel multiview spatio-temporal and-or graph (MST-AOG) representation for cross-view action recognition, i.e., the recognition is performed on the video from an unknown and unseen view. As a compositional model, MST-AOG compactly represents the hierarchical combinatorial structures of cross-view actions by explicitly modeling the geometry, appearance and motion variations. This paper proposes effective methods to learn the structure and parameters of MST-AOG. The inference based on MST-AOG enables action recognition from novel views. The training of MST-AOG takes advantage of the 3D human skeleton data obtained from Kinect cameras to avoid annotating enormous multi-view video frames, which is error-prone and time-consuming, but the recognition does not need 3D information and is based on 2D video input. A new Multiview Action3D dataset has been created and will be released. Extensive experiments have demonstrated that this new action representation significantly improves the accuracy and robustness for cross-view action recognition on 2D videos.
Keywords :
cameras; geometry; graph theory; image motion analysis; image recognition; image representation; video signal processing; 2D videos; 3D human skeleton data; Kinect cameras; MST-AOG representation; action representation; appearance modeling; compositional model; cross-view action learning; cross-view action modeling; cross-view action recognition; cross-view actions; geometry modeling; hierarchical combinatorial structures; motion variation modeling; multiview action 3D dataset; multiview spatio-temporal AND-OR graph representation; video-based action recognition; Joints; Pattern recognition; Solid modeling; Three-dimensional displays; Training; Training data;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
DOI :
10.1109/CVPR.2014.339