Semantic structure from motion with points, regions, and objects

Author

Bao, Sid Yingze ; Bagra, Mohit ; Chao, Yu-Wei ; Savarese, Silvio

Author_Institution

Dept. of Electr. & Comput. Eng., Univ. of Michigan at Ann Arbor, Ann Arbor, MI, USA

fYear

2012

fDate

16-21 June 2012

Firstpage

2703

Lastpage

2710

Abstract

Structure from motion (SFM) aims at jointly recovering the structure of a scene as a collection of 3D points and estimating the camera poses from a number of input images. In this paper we generalize this concept: not only do we want to recover 3D points, but also recognize and estimate the location of high level semantic scene components such as regions and objects in 3D. As a key ingredient for this joint inference problem, we seek to model various types of interactions between scene components. Such interactions help regularize our solution and obtain more accurate results than solving these problems in isolation. Experiments on public datasets demonstrate that: 1) our framework estimates camera poses more robustly than SFM algorithms that use points only; 2) our framework is capable of accurately estimating pose and location of objects, regions, and points in the 3D scene; 3) our framework recognizes objects and regions more accurately than state-of-the-art single image recognition methods.

Keywords

cameras; object recognition; pose estimation; 3D point collection; 3D scene; SFM algorithms; camera pose estimation; high level semantic scene components; image recognition methods; joint inference problem; object location; object points; object recognition; object region; region recognition; scene structure recovery; semantic structure from motion; Cameras; Energy measurement; Estimation; Geometry; Roads; Robustness; Semantics;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

Conference_Location

Providence, RI

ISSN

1063-6919

Print_ISBN

978-1-4673-1226-4

Electronic_ISBN

1063-6919

Type

conf

DOI

10.1109/CVPR.2012.6247992

Filename

6247992