Title :
Interactive Phrases: Semantic Descriptionsfor Human Interaction Recognition
Author :
Yu Kong ; Yunde Jia ; Yun Fu
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA
Abstract :
This paper addresses the problem of recognizing human interactions from videos. We propose a novel approach that recognizes human interactions by the learned high-level descriptions, interactive phrases. Interactive phrases describe motion relationships between interacting people. These phrases naturally exploit human knowledge and allow us to construct a more descriptive model for recognizing human interactions. We propose a discriminative model to encode interactive phrases based on the latent SVM formulation. Interactive phrases are treated as latent variables and are used as mid-level features. To complement manually specified interactive phrases, we also discover data-driven phrases from data in order to find potentially useful and discriminative phrases for differentiating human interactions. An information-theoretic approach is employed to learn the data-driven phrases. The interdependencies between interactive phrases are explicitly captured in the model to deal with motion ambiguity and partial occlusion in the interactions. We evaluate our method on the BIT-Interaction data set, UT-Interaction data set, and Collective Activity data set. Experimental results show that our approach achieves superior performance over previous approaches.
Keywords :
gesture recognition; support vector machines; video signal processing; BIT-interaction data set; UT-interaction data set; collective activity data set; data-driven phrases; descriptive model; discriminative phrases; human interaction recognition; human knowledge; information-theoretic approach; interacting people; interactive phrase encoding; latent SVM formulation; latent variables; learned high-level descriptions; mid-level features; motion ambiguity; motion relationships; partial occlusion; specified interactive phrases; training video; Feature extraction; Hidden Markov models; Semantics; Torso; Training; Vectors; Videos; Human interaction; action recognition; latent structural SVM;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2014.2303090