مرکز منطقه ای اطلاع رساني علوم و فناوري - A review of inverse reinforcement learning theory and recent advances

DocumentCode :

2821472

Title :

A review of inverse reinforcement learning theory and recent advances

Author :

Zhifei, Shao ; Joo, Er Meng

Author_Institution :

Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore

fYear :

2012

fDate :

10-15 June 2012

Firstpage :

Lastpage :

Abstract :

A major challenge faced by machine learning community is the decision making problems under uncertainty. Reinforcement Learning (RL) techniques provide a powerful solution for it. An agent used by RL interacts with a dynamic environment and finds a policy through a reward function, without using target labels like Supervised Learning (SL). However, one fundamental assumption of existing RL algorithms is that reward function, the most succinct representation of the designer´s intention, needs to be provided beforehand. In practice, the reward function can be very hard to specify and exhaustive to tune for large and complex problems, and this inspires the development of Inverse Reinforcement Learning (IRL), an extension of RL, which directly tackles this problem by learning the reward function through expert demonstrations. IRL introduces a new way of learning policies by deriving expert´s intentions, in contrast to directly learning policies, which can be redundant and have poor generalization ability. In this paper, the original IRL algorithms and its close variants, as well as their recent advances are reviewed and compared.

Keywords :

expert systems; learning (artificial intelligence); multi-agent systems; IRL techniques; SL; agent; decision making problems; dynamic environment; expert demonstrations; expert intentions; inverse reinforcement learning theory; machine learning community; reward function; supervised learning; target labels; Educational institutions; Helicopters; Learning; Optimization; Prediction algorithms; Robots; Trajectory; Reinforcement learning; expert demonstration; inverse reinforcement learning; reward function;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Evolutionary Computation (CEC), 2012 IEEE Congress on

Conference_Location :

Brisbane, QLD

Print_ISBN :

978-1-4673-1510-4

Electronic_ISBN :

978-1-4673-1508-1

Type :

conf

DOI :

10.1109/CEC.2012.6256507

Filename :

6256507

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2821472