• DocumentCode
    2082980
  • Title

    Understanding manipulation in video

  • Author

    Brand, Matthew

  • Author_Institution
    Media Lab., MIT, Cambridge, MA, USA
  • fYear
    1996
  • fDate
    14-16 Oct 1996
  • Firstpage
    94
  • Lastpage
    99
  • Abstract
    Manipulations are a significant subset of human gestures that are distinguished by the fact that their logic and meaning are particularly clear, being heavily constrained by physical causality. We present techniques and causal semantics for interpreting video of manipulation tasks such as disassembly. Psychologically-based causal constraints are used to detect meaningful changes in the integrity and motions of foreground segmented blobs; a small causal model of manipulation is used to disambiguate and parse these into a coherent account of video´s action. The causal constraints are drawn from studies of infant perceptual development; as with infants, they precede and may possibly even bootstrap the ability to reliably segment still objects. Our implementation produces a script of the causal evolution of the scene-output that supports cartoon summary, automated editing, and higher-level reasoning
  • Keywords
    image segmentation; image sequences; motion estimation; spatial reasoning; automated editing; cartoon summary; causal semantics; disassembly; foreground segmented blobs; higher-level reasoning; infant perceptual development; manipulation understanding; physical causality; psychologically-based causal constraints; still objects; Assembly; CD-ROMs; Humans; Layout; Logic; Motion detection; Physics; Psychology; Surface reconstruction; Turning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Face and Gesture Recognition, 1996., Proceedings of the Second International Conference on
  • Conference_Location
    Killington, VT
  • Print_ISBN
    0-8186-7713-9
  • Type

    conf

  • DOI
    10.1109/AFGR.1996.557249
  • Filename
    557249