مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning multiple behaviours using hierarchical clustering of rewards

DocumentCode :

3709737

Title :

Learning multiple behaviours using hierarchical clustering of rewards

Author :

Javier Almingol;Luis Montesano

Author_Institution :

Universidad de Zaragoza, Instituto de Investigació

fYear :

2015

Firstpage :

4608

Lastpage :

4613

Abstract :

Learning models of behaviours has many applications in robotics spanning both control, e.g. learning from demonstration and perception, e.g. monitoring and surveillance. Inverse reinforcement learning encodes behaviours as a reward function learned from a set of demonstrations. This paper addresses the problem of learning from unlabelled datasets containing an unknown number of behaviours in continuous action-state spaces. The proposed method uses a hierarchical clustering approach to directly group trajectories that share a common reward function. The similarity metric is based on the distribution of maximum entropy of the feature counts computed using path integrals. We evaluated the method in three different tasks: navigation on a set of synthetic maps, human driving styles on a simulator and human reaching. Results show that clustering in the reward space is able to discover the latent reward structure resulting in compact models that can generate all the observed behaviours.

Keywords :

"Trajectory","Cost function","Entropy","Clustering algorithms","Learning (artificial intelligence)","Space exploration","Aerospace electronics"

Publisher :

ieee

Conference_Titel :

Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on

Type :

conf

DOI :

10.1109/IROS.2015.7354033

Filename :

7354033

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3709737