مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

67259

Title :

Learning to Segment and Track in RGBD

Author :

Teichman, Alex ; Lussier, Jake T. ; Thrun, Sebastian

Author_Institution :

Stanford Univ., Stanford, CA, USA

Volume :

Issue :

fYear :

2013

fDate :

Oct. 2013

Firstpage :

841

Lastpage :

852

Abstract :

We consider the problem of segmenting and tracking deformable objects in color video with depth (RGBD) data available from commodity sensors such as the Asus Xtion Pro Live or Microsoft Kinect. We frame this problem with very few assumptions-no prior object model, no stationary sensor, and no prior 3-D map-thus making a solution potentially useful for a large number of applications, including semi-supervised learning, 3-D model capture, and object recognition. Our approach makes use of a rich feature set, including local image appearance, depth discontinuities, optical flow, and surface normals to inform the segmentation decision in a conditional random field model. In contrast to previous work in this field, the proposed method learns how to best make use of these features from ground-truth segmented sequences. We provide qualitative and quantitative analyses which demonstrate substantial improvement over the state of the art. This paper is an extended version of our previous work. Building on our previous work, we show that it is possible to achieve an order of magnitude speedup and thus real-time performance ( ~ 20 FPS) on a laptop computer by applying simple algorithmic optimizations to the original work. This speedup comes at only a minor cost in overall accuracy and thus makes this approach applicable to a broader range of tasks. We demonstrate one such task: real-time, online, interactive segmentation to efficiently collect training data for an off-the-shelf object detector.

Keywords :

image colour analysis; image segmentation; image sequences; learning (artificial intelligence); object detection; object recognition; object tracking; solid modelling; statistical analysis; video signal processing; 3D model capture; Asus Xtion Pro Live; Microsoft Kinect; RGBD; algorithmic optimizations; color video-with-depth; conditional random field model; deformable object segmentation; deformable object tracking; depth discontinuities; interactive segmentation; laptop computer; local image appearance; magnitude speedup; object recognition; off-the-shelf object detector; optical flow; semi-supervised learning; surface normals; Image edge detection; Image segmentation; Machine vision; Object recognition; Object segmentation; Sensors; Image segmentation; machine vision; object recognition; object segmentation;

fLanguage :

English

Journal_Title :

Automation Science and Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

1545-5955

Type :

jour

DOI :

10.1109/TASE.2013.2264286

Filename :

6573398

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=67259