• DocumentCode
    2501898
  • Title

    Recognizing and Localizing Individual Activities through Graph Matching

  • Author

    Ta, Anh-Phuong ; Wolf, Christian ; Lavoue, Guillaume ; Baskurt, Atilla

  • Author_Institution
    INSA-Lyon, Univ. de Lyon, Lyon, France
  • fYear
    2010
  • fDate
    Aug. 29 2010-Sept. 1 2010
  • Firstpage
    196
  • Lastpage
    203
  • Abstract
    In this paper we tackle the problem of detecting individual human actions in video sequences. While the most successful methods are based on local features, which proved that they can deal with changes in background, scale and illumination, most existing methods have two main shortcomings: first, they are mainly based on the individual power of spatio-temporal interest points (STIP), and therefore ignore the spatio-temporal relationships between them. Second, these methods mainly focus on direct classification techniques to classify the human activities, as opposed to detection and localization. In order to overcome these limitations, we propose a new approach, which is based on a graph matching algorithm for activity recognition. In contrast to most previous methods which classify entire video sequences, we design a video matching method from two sets of ST-points for human activity recognition. First, points are extracted, and a hyper graphs are constructed from them, i.e. graphs with edges involving more than 2 nodes (3 in our case). The activity recognition problem is then transformed into a problem of finding instances of model graphs in the scene graph. By matching local features instead of classifying entire sequences, our method is able to detect multiple different activities which occur simultaneously in a video sequence. Experiments on two standard datasets demonstrate that our method is comparable to the existing techniques on classification, and that it can, additionally, detect and localize activities.
  • Keywords
    graph theory; image classification; image sequences; spatiotemporal phenomena; graph matching algorithm; human activity recognition; hyper graph; local features; spatio temporal interest point; video matching method; video sequences; Complexity theory; Computational modeling; Feature extraction; Humans; Mathematical model; Robustness; Video sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Video and Signal Based Surveillance (AVSS), 2010 Seventh IEEE International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-8310-5
  • Type

    conf

  • DOI
    10.1109/AVSS.2010.81
  • Filename
    5597142