Title :
Discriminative Apprenticeship Learning with Both Preference and Non-preference Behavior
Author :
Dingsheng Luo ; Yi Wang ; Xihong Wu
Author_Institution :
Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
Abstract :
Considering that expert´s demonstrations are usually sub optimal and failed demonstrations often have some useful guidance, in this paper, a Discriminative Apprenticeship Learning algorithm is proposed, where the apprentice is taught with the join of failed attempts to acquire the ability that could discriminate the preference and non-preference cases so that to actively take a corresponding action. Since robot usually encounters changing environments, generalization ability is taken into account in the algorithm through which the reward function is recovered under the evaluation of generalization error. The problem of the representation error is also analyzed and involved in the algorithm. To ensure performance of the algorithm, theoretical guarantee is presented. Experiments on a simple car-driving robot and the comparison with a variety of inverse reinforcement learning methods are performed, which illustrate the proposed method is an effective and promising alternative.
Keywords :
learning (artificial intelligence); robots; car-driving robot; changing environments; discriminative apprenticeship learning; generalization ability; generalization error; nonpreference behavior; preference behavior; reinforcement learning methods; reward function; Indium phosphide; Learning (artificial intelligence); Robots; Training; Training data; Trajectory; Vectors; Discriminative apprenticeship learning; Inverse reinforcement learning; Preference and non-preference;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2013 12th International Conference on
Conference_Location :
Miami, FL
DOI :
10.1109/ICMLA.2013.64