مرکز منطقه ای اطلاع رساني علوم و فناوري - Deep learning of spatio-temporal features with geometric-based moving point detection for motion segmentation

DocumentCode :

250476

Title :

Deep learning of spatio-temporal features with geometric-based moving point detection for motion segmentation

Author :

Tsung-Han Lin ; Chieh-Chih Wang

Author_Institution :

Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan

fYear :

2014

fDate :

May 31 2014-June 7 2014

Firstpage :

3058

Lastpage :

3065

Abstract :

This paper introduces an approach to accomplish motion segmentation from a moving stereo camera based on deep learning. Previous work on moving object detection mostly use point features based on 3D geometric constraints. However, point features require good features, and are hard to detect or to be matched correctly in situations where objects have smooth textures. To alleviate this problem, learning high-level spatio-temporal features unsupervisedly from raw image data based on Reconstruction Independent Component Analysis (RICA) autoencoders is proposed. Despite the power of the new spatio-temporal features, these features cannot not learn and be used to interpret 3D geometry of dynamic scenes, which is critical for moving object detection from moving cameras. As detected moving points based on 3D geometric constraints still contain valuable information of 3D scene as well as the camera egomotion, we propose a framework that incorporates both the detected moving point results and the learned spatio-temporal features as inputs to Recursive Neural Networks (RNN) that performs motion segmentation. Both features effectively complement each other. The proposed approach is demonstrated with real-world stereo video data that contains multiple moving objects, and has achieved 26% better detection rate over the existing 3D geometric-based moving points detector.

Keywords :

feature extraction; image motion analysis; image segmentation; independent component analysis; learning (artificial intelligence); neural nets; object detection; stereo image processing; traffic engineering computing; video signal processing; 3D geometric constraints; 3D geometric-based moving point detector; 3D scene; RICA autoencoders; camera egomotion; deep spatiotemporal feature learning; geometric-based moving point detection; motion segmentation; moving object detection; moving stereo camera; point feature detection; real-world stereo video data; reconstruction independent component analysis; recursive neural networks; Cameras; Computer vision; Feature extraction; Image edge detection; Image segmentation; Motion segmentation; Three-dimensional displays;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation (ICRA), 2014 IEEE International Conference on

Conference_Location :

Hong Kong

Type :

conf

DOI :

10.1109/ICRA.2014.6907299

Filename :

6907299

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=250476