مرکز منطقه ای اطلاع رساني علوم و فناوري - An improved Monte Carlo POMDPs online planning algorithm combined with RAVE heuristic

DocumentCode :

3699181

Title :

An improved Monte Carlo POMDPs online planning algorithm combined with RAVE heuristic

Author :

Peigen Liu;Jing Chen;Hongfu Liu

Author_Institution :

College of Mechatronic Engineering and Automation, National University of Defense Technology, Changsha, Hunan Province, China

fYear :

2015

Firstpage :

511

Lastpage :

515

Abstract :

Partially observable Markov decision processes (POMDPs) provide a kind of general model that can deal with problems in uncertain environment efficiently. There are many different planning methods for POMDPs model. Partially Observable Monte Carlo planning (POMCP) method, using Monte Carlo Tree Search (MCTS) method, which can help break the curse of dimensionality and the curse of history. However, the method has strong dependence on the count of simulations. The POMCP algorithm was improved in this paper by combining Rapid Action Value Estimate (RAVE) method and MCTS. There´s less dependence on the count of simulations and higher efficiency in the improved algorithm, which is a promising online planning algorithm. Experimental results on the benchmark problems indicate that efficiency of the improved algorithm is higher than the basic POMCP algorithm.

Keywords :

"Planning","History","Monte Carlo methods","Algorithm design and analysis","Learning (artificial intelligence)","Computers","Markov processes"

Publisher :

ieee

Conference_Titel :

Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on

ISSN :

2327-0586

Print_ISBN :

978-1-4799-8352-0

Electronic_ISBN :

2327-0594

Type :

conf

DOI :

10.1109/ICSESS.2015.7339109

Filename :

7339109

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3699181