DocumentCode :
663585
Title :
Multimodal integration learning of object manipulation behaviors using deep neural networks
Author :
Noda, Kentaro ; Arie, Hiroaki ; Suga, Yuji ; Ogata, Takaaki
Author_Institution :
Grad. Sch. of Fundamental Sci. & Eng., Waseda Univ., Tokyo, Japan
fYear :
2013
fDate :
3-7 Nov. 2013
Firstpage :
1728
Lastpage :
1733
Abstract :
This paper presents a novel computational approach for modeling and generating multiple object manipulation behaviors by a humanoid robot. The contribution of this paper is that deep learning methods are applied not only for multimodal sensor fusion but also for sensory-motor coordination. More specifically, a time-delay deep neural network is applied for modeling multiple behavior patterns represented with multi-dimensional visuomotor temporal sequences. By using the efficient training performance of Hessian-free optimization, the proposed mechanism successfully models six different object manipulation behaviors in a single network. The generalization capability of the learning mechanism enables the acquired model to perform the functions of cross-modal memory retrieval and temporal sequence prediction. The experimental results show that the motion patterns for object manipulation behaviors are successfully generated from the corresponding image sequence, and vice versa. Moreover, the temporal sequence prediction enables the robot to interactively switch multiple behaviors in accordance with changes in the displayed objects.
Keywords :
delay systems; generalisation (artificial intelligence); humanoid robots; image sequences; intelligent robots; learning systems; manipulators; neurocontrollers; optimisation; prediction theory; robot vision; sensor fusion; Hessian-free optimization; computational approach; cross-modal memory retrieval; generalization capability; humanoid robot; image sequence; multidimensional visuomotor temporal sequences; multimodal integration learning; multimodal sensor fusion; multiple behavior pattern modeling; object manipulation behaviors; sensory-motor coordination; temporal sequence prediction; time-delay deep neural network; training performance; Image reconstruction; Image sequences; Joints; Neural networks; Robot sensing systems; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
Conference_Location :
Tokyo
ISSN :
2153-0858
Type :
conf
DOI :
10.1109/IROS.2013.6696582
Filename :
6696582
Link To Document :
بازگشت