Title :
Deep learning of spatio-temporal features with geometric-based moving point detection for motion segmentation
Author :
Tsung-Han Lin ; Chieh-Chih Wang
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fDate :
May 31 2014-June 7 2014
Abstract :
This paper introduces an approach to accomplish motion segmentation from a moving stereo camera based on deep learning. Previous work on moving object detection mostly use point features based on 3D geometric constraints. However, point features require good features, and are hard to detect or to be matched correctly in situations where objects have smooth textures. To alleviate this problem, learning high-level spatio-temporal features unsupervisedly from raw image data based on Reconstruction Independent Component Analysis (RICA) autoencoders is proposed. Despite the power of the new spatio-temporal features, these features cannot not learn and be used to interpret 3D geometry of dynamic scenes, which is critical for moving object detection from moving cameras. As detected moving points based on 3D geometric constraints still contain valuable information of 3D scene as well as the camera egomotion, we propose a framework that incorporates both the detected moving point results and the learned spatio-temporal features as inputs to Recursive Neural Networks (RNN) that performs motion segmentation. Both features effectively complement each other. The proposed approach is demonstrated with real-world stereo video data that contains multiple moving objects, and has achieved 26% better detection rate over the existing 3D geometric-based moving points detector.
Keywords :
feature extraction; image motion analysis; image segmentation; independent component analysis; learning (artificial intelligence); neural nets; object detection; stereo image processing; traffic engineering computing; video signal processing; 3D geometric constraints; 3D geometric-based moving point detector; 3D scene; RICA autoencoders; camera egomotion; deep spatiotemporal feature learning; geometric-based moving point detection; motion segmentation; moving object detection; moving stereo camera; point feature detection; real-world stereo video data; reconstruction independent component analysis; recursive neural networks; Cameras; Computer vision; Feature extraction; Image edge detection; Image segmentation; Motion segmentation; Three-dimensional displays;
Conference_Titel :
Robotics and Automation (ICRA), 2014 IEEE International Conference on
Conference_Location :
Hong Kong
DOI :
10.1109/ICRA.2014.6907299