مرکز منطقه ای اطلاع رساني علوم و فناوري - Road scene labeling using SfM module and 3D bag of textons

DocumentCode :

2266312

Title :

Road scene labeling using SfM module and 3D bag of textons

Author :

Kang, Yousun ; Yamaguchi, Koichiro ; Naito, Takashi ; Ninomiya, Yoshiki

Author_Institution :

Toyota Central R&D Labs., Inc., Nagakute, Japan

fYear :

2009

fDate :

Sept. 27 2009-Oct. 4 2009

Firstpage :

657

Lastpage :

664

Abstract :

Structure from motion (SfM) and appearance-based segmentation have played an important role in the interpretation of road scenes. The integration of these approaches can lead to good performance during interpretation since the relation between 3D spatial structure and 2D semantic segmentation can be taken into account. This paper presents a new integration framework using an SfM module and a bag of textons method for road scene labeling. By using a multi-band image, which consists of a near-infrared and a visible color image, we can generate better discriminative textons than those generated by using only a color image. Our SfM module can accurately estimate the ego motion of the vehicle and reconstruct a 3D structure of the road scene. The bag of textons is computed over local rectangular regions: its size depends on the distance of the textons. Therefore, the 3D bag of textons method can help to effectively recognize the objects of a road scene because it considers the object´s 3D structure. For solving the labeling problem, we employ a pairwise conditional random field (CRF) model. The unary potential of the CRF model is affected by SfM results, and the pairwise potential is optimized by the multi-band image intensity. Experimental results show that the proposed method can effectively classify the objects in a 2D road scene with 3D structures. The proposed system can revolutionize 3D scene understanding systems used for vehicle environment perception.

Keywords :

image colour analysis; image segmentation; 2D semantic segmentation; 3D bag of textons; 3D spatial structure; 3D structure; SfM module; appearance-based segmentation; discriminative textons; integration framework; multiband image intensity; near-infrared image; pairwise conditional random field model; road scene labeling; structure from motion; visible color image; Labeling; Layout;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th International Conference on

Conference_Location :

Kyoto

Print_ISBN :

978-1-4244-4442-7

Electronic_ISBN :

978-1-4244-4441-0

Type :

conf

DOI :

10.1109/ICCVW.2009.5457641

Filename :

5457641

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2266312