DocumentCode
2473071
Title
Adaptive state aggregation for reinforcement learning
Author
Hwang, Kao-Shing ; Chen, Yu-Jen ; Jiang, Wei-Cheng
Author_Institution
Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
fYear
2012
fDate
14-17 Oct. 2012
Firstpage
2452
Lastpage
2456
Abstract
State partition is an important issue in reinforcement learning, because it has a significant effect on the performance. In this paper, an adaptive state partition method is presented for discretizing the state space adaptively and makes use of decision trees effectively. The proposed method splits the state space according to the temporal difference generated by the reinforcement learning. Consequently, the reinforcement learning uses the state space partitioned by the decision tree to learn the policy simultaneously. For avoiding a trivial partition, sibling nodes are pruned according to the Activity and the Reliability. A Monte-Carlo Tree Search (MCTS) is also proposed to explore the policy. A simulation for approaching goal has been conducted to demonstrate that the proposed method can achieve the design goal.
Keywords
Monte Carlo methods; decision trees; learning (artificial intelligence); state-space methods; tree searching; MCTS; Monte-Carlo tree search; adaptive state aggregation; adaptive state partition method; decision trees; reinforcement learning; reliability; sibling nodes; state space; temporal difference; trivial partition; Decision trees; Estimation error; Learning; Markov processes; Monte Carlo methods; Reliability; Vectors; MCTS; decision tree; reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4673-1713-9
Electronic_ISBN
978-1-4673-1712-2
Type
conf
DOI
10.1109/ICSMC.2012.6378111
Filename
6378111
Link To Document