DocumentCode :
3699181
Title :
An improved Monte Carlo POMDPs online planning algorithm combined with RAVE heuristic
Author :
Peigen Liu;Jing Chen;Hongfu Liu
Author_Institution :
College of Mechatronic Engineering and Automation, National University of Defense Technology, Changsha, Hunan Province, China
fYear :
2015
Firstpage :
511
Lastpage :
515
Abstract :
Partially observable Markov decision processes (POMDPs) provide a kind of general model that can deal with problems in uncertain environment efficiently. There are many different planning methods for POMDPs model. Partially Observable Monte Carlo planning (POMCP) method, using Monte Carlo Tree Search (MCTS) method, which can help break the curse of dimensionality and the curse of history. However, the method has strong dependence on the count of simulations. The POMCP algorithm was improved in this paper by combining Rapid Action Value Estimate (RAVE) method and MCTS. There´s less dependence on the count of simulations and higher efficiency in the improved algorithm, which is a promising online planning algorithm. Experimental results on the benchmark problems indicate that efficiency of the improved algorithm is higher than the basic POMCP algorithm.
Keywords :
"Planning","History","Monte Carlo methods","Algorithm design and analysis","Learning (artificial intelligence)","Computers","Markov processes"
Publisher :
ieee
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on
ISSN :
2327-0586
Print_ISBN :
978-1-4799-8352-0
Electronic_ISBN :
2327-0594
Type :
conf
DOI :
10.1109/ICSESS.2015.7339109
Filename :
7339109
Link To Document :
بازگشت