مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-task learning of system dynamics with maximum information gain

DocumentCode :

2693975

Title :

Multi-task learning of system dynamics with maximum information gain

Author :

Zubizarreta-Rodriguez, Jose F. ; Ramos, Fabio

Author_Institution :

Australian Centre for Field Robot., Univ. of Sydney, Sydney, NSW, Australia

fYear :

2011

fDate :

9-13 May 2011

Firstpage :

5709

Lastpage :

5715

Abstract :

This paper introduces a new approach to adoptively learn the dynamics of a robotic system. The methodology is based on maximizing the information gain from new observations while modeling the dynamics with a Multiple Output Gaussian Process (MOGP). High-dimensional state action spaces with unknown dependencies between inputs and outputs can be highly computationally expensive to learn. Gaussian process modeling is a Bayesian technique that naturally overcomes one of the most difficult problems in machine learning known as over-fitting. This makes it very appealing for on-line problems where testing multiple hypothesis is difficult. The computational cost of the learning task is reduced by having a smaller dataset of informative training points. Therefore we introduce a learning strategy capable of determining the most informative training set for the MOGP. This method can be implemented for learning the behavior of dynamic systems where due to their complexity and disturbances are infeasible to be analytically defined. The benefits of our approach are verified in two experiments: learning the dynamics of a cart pole system in simulation and the dynamics of a robotic blimp.

Keywords :

Bayes methods; Gaussian processes; aerospace robotics; airships; learning (artificial intelligence); robot dynamics; Bayesian technique; MOGP; cart pole system; computational cost reduction; high-dimensional state action space; informative training set; machine learning; maximum information gain; multiple output Gaussian process; multitask learning; over fitting; robotic blimp; robotic system; system dynamics; Covariance matrix; Heuristic algorithms; Kernel; Mathematical model; Robots; Training; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation (ICRA), 2011 IEEE International Conference on

Conference_Location :

Shanghai

ISSN :

1050-4729

Print_ISBN :

978-1-61284-386-5

Type :

conf

DOI :

10.1109/ICRA.2011.5979961

Filename :

5979961

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2693975