مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimistic planning for sparsely stochastic systems

DocumentCode :

3642772

Title :

Optimistic planning for sparsely stochastic systems

Author :

Lucian Buşoniu;Rémi Munos;Bart De Schutter;Robert Babuška

Author_Institution :

Delft Center for Systems and Control, Delft University of Technology, the Netherlands

fYear :

2011

fDate :

4/1/2011 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

We propose an online planning algorithm for finite-action, sparsely stochastic Markov decision processes, in which the random state transitions can only end up in a small number of possible next states. The algorithm builds a planning tree by iteratively expanding states, where each expansion exploits sparsity to add all possible successor states. Each state to expand is actively chosen to improve the knowledge about action quality, and this allows the algorithm to return a good action after a strictly limited number of expansions. More specifically, the active selection method is optimistic in that it chooses the most promising states first, so the novel algorithm is called optimistic planning for sparsely stochastic systems. We note that the new algorithm can also be seen as model-predictive (receding-horizon) control. The algorithm obtains promising numerical results, including the successful online control of a simulated HIV infection with stochastic drug effectiveness.

Keywords :

"Planning","Markov processes","Stochastic systems","Upper bound","Computational modeling","Prediction algorithms"

Publisher :

ieee

Conference_Titel :

Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on

Print_ISBN :

978-1-4244-9887-1

Type :

conf

DOI :

10.1109/ADPRL.2011.5967375

Filename :

5967375

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3642772