Title :
Recognizing and Localizing Individual Activities through Graph Matching
Author :
Ta, Anh-Phuong ; Wolf, Christian ; Lavoue, Guillaume ; Baskurt, Atilla
Author_Institution :
INSA-Lyon, Univ. de Lyon, Lyon, France
fDate :
Aug. 29 2010-Sept. 1 2010
Abstract :
In this paper we tackle the problem of detecting individual human actions in video sequences. While the most successful methods are based on local features, which proved that they can deal with changes in background, scale and illumination, most existing methods have two main shortcomings: first, they are mainly based on the individual power of spatio-temporal interest points (STIP), and therefore ignore the spatio-temporal relationships between them. Second, these methods mainly focus on direct classification techniques to classify the human activities, as opposed to detection and localization. In order to overcome these limitations, we propose a new approach, which is based on a graph matching algorithm for activity recognition. In contrast to most previous methods which classify entire video sequences, we design a video matching method from two sets of ST-points for human activity recognition. First, points are extracted, and a hyper graphs are constructed from them, i.e. graphs with edges involving more than 2 nodes (3 in our case). The activity recognition problem is then transformed into a problem of finding instances of model graphs in the scene graph. By matching local features instead of classifying entire sequences, our method is able to detect multiple different activities which occur simultaneously in a video sequence. Experiments on two standard datasets demonstrate that our method is comparable to the existing techniques on classification, and that it can, additionally, detect and localize activities.
Keywords :
graph theory; image classification; image sequences; spatiotemporal phenomena; graph matching algorithm; human activity recognition; hyper graph; local features; spatio temporal interest point; video matching method; video sequences; Complexity theory; Computational modeling; Feature extraction; Humans; Mathematical model; Robustness; Video sequences;
Conference_Titel :
Advanced Video and Signal Based Surveillance (AVSS), 2010 Seventh IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-8310-5
DOI :
10.1109/AVSS.2010.81