Title :
Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities
Author :
Ryoo, M.S. ; Aggarwal, J.K.
Author_Institution :
Robot Res. Dept., Electron. & Telecommun. Res. Inst., Daejeon, South Korea
fDate :
Sept. 29 2009-Oct. 2 2009
Abstract :
Human activity recognition is a challenging task, especially when its background is unknown or changing, and when scale or illumination differs in each video. Approaches utilizing spatio-temporal local features have proved that they are able to cope with such difficulties, but they mainly focused on classifying short videos of simple periodic actions. In this paper, we present a new activity recognition methodology that overcomes the limitations of the previous approaches using local features. We introduce a novel matching, spatio-temporal relationship match, which is designed to measure structural similarity between sets of features extracted from two videos. Our match hierarchically considers spatio-temporal relationships among feature points, thereby enabling detection and localization of complex non-periodic activities. In contrast to previous approaches to `classify´ videos, our approach is designed to `detect and localize´ all occurring activities from continuous videos where multiple actors and pedestrians are present. We implement and test our methodology on a newly-introduced dataset containing videos of multiple interacting persons and individual pedestrians. The results confirm that our system is able to recognize complex non-periodic activities (e.g. `push´ and `hug´) from sets of spatio-temporal features even when multiple activities are present in the scene.
Keywords :
video signal processing; complex human activities recognition; complex nonperiodic activities; human activity recognition; individual pedestrians; multiple interacting persons; spatio-temporal features; spatio-temporal local features; spatio-temporal relationship; structural similarity; video structure comparison; Cameras; Computer vision; Feature extraction; Humans; Intelligent robots; Layout; Lighting; Robot vision systems; Surveillance; System testing;
Conference_Titel :
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-4420-5
Electronic_ISBN :
1550-5499
DOI :
10.1109/ICCV.2009.5459361