Adaptive state aggregation for reinforcement learning

Author

Hwang, Kao-Shing ; Chen, Yu-Jen ; Jiang, Wei-Cheng

Author_Institution

Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan

fYear

2012

fDate

14-17 Oct. 2012

Firstpage

2452

Lastpage

2456

Abstract

State partition is an important issue in reinforcement learning, because it has a significant effect on the performance. In this paper, an adaptive state partition method is presented for discretizing the state space adaptively and makes use of decision trees effectively. The proposed method splits the state space according to the temporal difference generated by the reinforcement learning. Consequently, the reinforcement learning uses the state space partitioned by the decision tree to learn the policy simultaneously. For avoiding a trivial partition, sibling nodes are pruned according to the Activity and the Reliability. A Monte-Carlo Tree Search (MCTS) is also proposed to explore the policy. A simulation for approaching goal has been conducted to demonstrate that the proposed method can achieve the design goal.

Keywords

Monte Carlo methods; decision trees; learning (artificial intelligence); state-space methods; tree searching; MCTS; Monte-Carlo tree search; adaptive state aggregation; adaptive state partition method; decision trees; reinforcement learning; reliability; sibling nodes; state space; temporal difference; trivial partition; Decision trees; Estimation error; Learning; Markov processes; Monte Carlo methods; Reliability; Vectors; MCTS; decision tree; reinforcement learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on

Conference_Location

Seoul

Print_ISBN

978-1-4673-1713-9

Electronic_ISBN

978-1-4673-1712-2

Type

conf

DOI

10.1109/ICSMC.2012.6378111

Filename

6378111