DocumentCode :
1644106
Title :
Expectation grammars: leveraging high-level expectations for activity recognition
Author :
Minnen, David ; Essa, Irfan ; Starner, Thad
Author_Institution :
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
2
fYear :
2003
Abstract :
Video-based recognition and prediction of a temporally extended activity can benefit from a detailed description of high-level expectations about the activity. Stochastic grammars allow for an efficient representation of such expectations and are well-suited for the specification of temporally well-ordered activities. In this paper, we extend stochastic grammars by adding event parameters, state checks, and sensitivity to an internal scene model. We present an implemented system that uses human-specified grammars to recognize a person performing the Towers of Hanoi task from a video sequence by analyzing object interaction events. Experimental results from several videos show robust recognition of the full task and its constituent sub-tasks even though no appearance models of the objects in the video are provided. These experiments include videos of the task performed with different shaped objects and with distracting and extraneous interactions.
Keywords :
computer vision; context-sensitive grammars; image motion analysis; image sequences; object detection; sensitivity; stereo image processing; stochastic processes; Towers of Hanoi; activity prediction; activity recognition; activity specification; appearance model; distracting interaction; event parameter; expectation grammar; expectation representation; extraneous interaction; high-level expectation; human-specified grammar; internal scene model; object interaction event analysis; person recognition; robust recognition; sensitivity; shaped object; state check; stochastic grammar; temporally extended activity; temporally well-ordered activity; video object; video sequence; video-based recognition; Educational institutions; Hidden Markov models; Humans; Layout; Performance analysis; Poles and towers; Robustness; Stochastic processes; Surveillance; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2003. Proceedings. 2003 IEEE Computer Society Conference on
ISSN :
1063-6919
Print_ISBN :
0-7695-1900-8
Type :
conf
DOI :
10.1109/CVPR.2003.1211525
Filename :
1211525
Link To Document :
بازگشت