DocumentCode
1592744
Title
Preparing various policies for interactive reinforcement learning for the SICE-ICASE International Joint Conference 2006 (SICE-ICCAS 2006)
Author
Satoh, Kazuhiro ; Yamaguchi, Tomohiro
Author_Institution
Fac. of Adv. Eng., Nara Nat. Coll. of Technol.
fYear
2006
Firstpage
2440
Lastpage
2444
Abstract
We propose a new method of preparing various policies to distinguish main rewards from temporal rewards toward the interactive reinforcement learning method in which reward functions are given incrementally from an initial state to the goal state. Shaping is the theoretical framework of interactive reinforcement learning. Most previous shaping researches assume shaping reward function that is monotonic distance function to the main goal and that is policy invariant. However, these assumptions will not be true on interactive reinforcement learning. To solve them, it is necessary to distinguish main rewards included in an expected optimal policy from temporal rewards only to guide its learning toward the optimal policy. This paper proposes the reward discrimination method for an interactive reinforcement learning agent. First, we introduce a concept of every-visit-optimality to define various policies. Then we present a method to search various policies on an identified MDP model. Experiments to evaluate the total search cost of acquiring various policies are performed between modified-PIA and our method. As the experimental results, our method holds the total search cost against increasing the number of rewards. This suggests that our method is better than previous reinforcement learning methods for interactive reinforcement learning in which many rewards are added incrementally
Keywords
interactive systems; learning (artificial intelligence); robots; search problems; every-visit-optimality concept; expected optimal policy; interactive reinforcement learning agent; reward discrimination method; reward function; Interactive; Reinforcement Learning; ev-optimality;
fLanguage
English
Publisher
ieee
Conference_Titel
SICE-ICASE, 2006. International Joint Conference
Conference_Location
Busan
Print_ISBN
89-950038-4-7
Electronic_ISBN
89-950038-5-5
Type
conf
DOI
10.1109/SICE.2006.315139
Filename
4108051
Link To Document