Title :
Point cloud video object segmentation using a persistent supervoxel world-model
Author :
Papon, Jeremie ; Kulvicius, Tomas ; Aksoy, Eren Erdal ; Worgotter, Florentin
Author_Institution :
Bernstein Center for Comput. Neurosci. (BCCN), Georg-August Univ. of Gottingen, Göttingen, Germany
Abstract :
Robust visual tracking is an essential precursor to understanding and replicating human actions in robotic systems. In order to accurately evaluate the semantic meaning of a sequence of video frames, or to replicate an action contained therein, one must be able to coherently track and segment all observed agents and objects. This work proposes a novel online point cloud based algorithm which simultaneously tracks 6DoF pose and determines spatial extent of all entities in indoor scenarios. This is accomplished using a persistent supervoxel world-model which is updated, rather than replaced, as new frames of data arrive. Maintenance of a world model enables general object permanence, permitting successful tracking through full occlusions. Object models are tracked using a bank of independent adaptive particle filters which use a supervoxel observation model to give rough estimates of object state. These are united using a novel multi-model RANSAC-like approach, which seeks to minimize a global energy function associating world-model supervoxels to predicted states. We present results on a standard robotic assembly benchmark for two application scenarios - human trajectory imitation and semantic action understanding - demonstrating the usefulness of the tracking in intelligent robotic systems.
Keywords :
adaptive filters; image segmentation; image sequences; intelligent robots; iterative methods; object tracking; particle filtering (numerical methods); pose estimation; robotic assembly; video signal processing; 6DoF pose estimation; full occlusions; general object permanence; global energy function; human actions; human trajectory imitation; independent adaptive particle filters; intelligent robotic systems; novel multimodel RANSAC-like approach; novel online point cloud based algorithm; object tracking; persistent supervoxel world-model; point cloud video object segmentation; robotic systems; robust visual tracking; semantic action understanding; spatial extent; standard robotic assembly benchmark; supervoxel observation model; video frame sequence; Image segmentation; Octrees; Robots; Target tracking; Three-dimensional displays; Trajectory; Visualization;
Conference_Titel :
Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
Conference_Location :
Tokyo
DOI :
10.1109/IROS.2013.6696886