Understanding manipulation in video

Author

Brand, Matthew

Author_Institution

Media Lab., MIT, Cambridge, MA, USA

fYear

1996

fDate

14-16 Oct 1996

Firstpage

94

Lastpage

99

Abstract

Manipulations are a significant subset of human gestures that are distinguished by the fact that their logic and meaning are particularly clear, being heavily constrained by physical causality. We present techniques and causal semantics for interpreting video of manipulation tasks such as disassembly. Psychologically-based causal constraints are used to detect meaningful changes in the integrity and motions of foreground segmented blobs; a small causal model of manipulation is used to disambiguate and parse these into a coherent account of video´s action. The causal constraints are drawn from studies of infant perceptual development; as with infants, they precede and may possibly even bootstrap the ability to reliably segment still objects. Our implementation produces a script of the causal evolution of the scene-output that supports cartoon summary, automated editing, and higher-level reasoning

Keywords

image segmentation; image sequences; motion estimation; spatial reasoning; automated editing; cartoon summary; causal semantics; disassembly; foreground segmented blobs; higher-level reasoning; infant perceptual development; manipulation understanding; physical causality; psychologically-based causal constraints; still objects; Assembly; CD-ROMs; Humans; Layout; Logic; Motion detection; Physics; Psychology; Surface reconstruction; Turning;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Face and Gesture Recognition, 1996., Proceedings of the Second International Conference on

Conference_Location

Killington, VT

Print_ISBN

0-8186-7713-9

Type

conf

DOI

10.1109/AFGR.1996.557249

Filename

557249