Title :
An improved Monte Carlo POMDPs online planning algorithm combined with RAVE heuristic
Author :
Peigen Liu;Jing Chen;Hongfu Liu
Author_Institution :
College of Mechatronic Engineering and Automation, National University of Defense Technology, Changsha, Hunan Province, China
Abstract :
Partially observable Markov decision processes (POMDPs) provide a kind of general model that can deal with problems in uncertain environment efficiently. There are many different planning methods for POMDPs model. Partially Observable Monte Carlo planning (POMCP) method, using Monte Carlo Tree Search (MCTS) method, which can help break the curse of dimensionality and the curse of history. However, the method has strong dependence on the count of simulations. The POMCP algorithm was improved in this paper by combining Rapid Action Value Estimate (RAVE) method and MCTS. There´s less dependence on the count of simulations and higher efficiency in the improved algorithm, which is a promising online planning algorithm. Experimental results on the benchmark problems indicate that efficiency of the improved algorithm is higher than the basic POMCP algorithm.
Keywords :
"Planning","History","Monte Carlo methods","Algorithm design and analysis","Learning (artificial intelligence)","Computers","Markov processes"
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on
Print_ISBN :
978-1-4799-8352-0
Electronic_ISBN :
2327-0594
DOI :
10.1109/ICSESS.2015.7339109