DocumentCode
264847
Title
Simulated Annealing Monte Carlo Tree Search for Large POMDPs
Author
Kai Xiong ; Hong Jiang
Author_Institution
Southwest Univ. of Sci. & Technol., Mianyang, China
Volume
1
fYear
2014
fDate
26-27 Aug. 2014
Firstpage
140
Lastpage
143
Abstract
Many planning and control problems can be modeled as large POMDPs, but very few can be solved scalably because of their computational complexity. This paper proposes a Simulated Annealing based on the Monte Carlo Tree Search for large POMDPs. The proposed algorithm determines an acceptance probability of sampling a back-propagation´s outcome in the simulated annealing process. The experiments show that the proposed SAMCTS (Simulated Annealing Monte Carlo Tree Search) outperforms the original Simulated Annealing algorithm when applied to a large POMDP benchmark problem.
Keywords
Markov processes; Monte Carlo methods; backpropagation; computational complexity; multi-agent systems; probability; simulated annealing; tree searching; POMDP benchmark problem; acceptance probability; agents; back-propagation outcome; computational complexity; partially observable Markov decision process; simulated annealing Monte Carlo tree search; Games; Markov processes; Monte Carlo methods; Planning; Rocks; Search problems; Simulated annealing; MCTS; POMDPs; Simulated Annealing;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2014 Sixth International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-1-4799-4956-4
Type
conf
DOI
10.1109/IHMSC.2014.42
Filename
6917325
Link To Document