Title :
Multi-task learning of system dynamics with maximum information gain
Author :
Zubizarreta-Rodriguez, Jose F. ; Ramos, Fabio
Author_Institution :
Australian Centre for Field Robot., Univ. of Sydney, Sydney, NSW, Australia
Abstract :
This paper introduces a new approach to adoptively learn the dynamics of a robotic system. The methodology is based on maximizing the information gain from new observations while modeling the dynamics with a Multiple Output Gaussian Process (MOGP). High-dimensional state action spaces with unknown dependencies between inputs and outputs can be highly computationally expensive to learn. Gaussian process modeling is a Bayesian technique that naturally overcomes one of the most difficult problems in machine learning known as over-fitting. This makes it very appealing for on-line problems where testing multiple hypothesis is difficult. The computational cost of the learning task is reduced by having a smaller dataset of informative training points. Therefore we introduce a learning strategy capable of determining the most informative training set for the MOGP. This method can be implemented for learning the behavior of dynamic systems where due to their complexity and disturbances are infeasible to be analytically defined. The benefits of our approach are verified in two experiments: learning the dynamics of a cart pole system in simulation and the dynamics of a robotic blimp.
Keywords :
Bayes methods; Gaussian processes; aerospace robotics; airships; learning (artificial intelligence); robot dynamics; Bayesian technique; MOGP; cart pole system; computational cost reduction; high-dimensional state action space; informative training set; machine learning; maximum information gain; multiple output Gaussian process; multitask learning; over fitting; robotic blimp; robotic system; system dynamics; Covariance matrix; Heuristic algorithms; Kernel; Mathematical model; Robots; Training; Training data;
Conference_Titel :
Robotics and Automation (ICRA), 2011 IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-386-5
DOI :
10.1109/ICRA.2011.5979961