DocumentCode :
264847
Title :
Simulated Annealing Monte Carlo Tree Search for Large POMDPs
Author :
Kai Xiong ; Hong Jiang
Author_Institution :
Southwest Univ. of Sci. & Technol., Mianyang, China
Volume :
1
fYear :
2014
fDate :
26-27 Aug. 2014
Firstpage :
140
Lastpage :
143
Abstract :
Many planning and control problems can be modeled as large POMDPs, but very few can be solved scalably because of their computational complexity. This paper proposes a Simulated Annealing based on the Monte Carlo Tree Search for large POMDPs. The proposed algorithm determines an acceptance probability of sampling a back-propagation´s outcome in the simulated annealing process. The experiments show that the proposed SAMCTS (Simulated Annealing Monte Carlo Tree Search) outperforms the original Simulated Annealing algorithm when applied to a large POMDP benchmark problem.
Keywords :
Markov processes; Monte Carlo methods; backpropagation; computational complexity; multi-agent systems; probability; simulated annealing; tree searching; POMDP benchmark problem; acceptance probability; agents; back-propagation outcome; computational complexity; partially observable Markov decision process; simulated annealing Monte Carlo tree search; Games; Markov processes; Monte Carlo methods; Planning; Rocks; Search problems; Simulated annealing; MCTS; POMDPs; Simulated Annealing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2014 Sixth International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-4956-4
Type :
conf
DOI :
10.1109/IHMSC.2014.42
Filename :
6917325
Link To Document :
بازگشت