مرکز منطقه ای اطلاع رساني علوم و فناوري - Bayesian Reinforcement Learning in Markovian and non-Markovian Tasks

DocumentCode :

3726539

Title :

Bayesian Reinforcement Learning in Markovian and non-Markovian Tasks

Author :

Adnane Ez-Zizi;Simon Farrell;David Leslie

Author_Institution :

Sch. of Exp. Psychol., Univ. of Bristol, Bristol, UK

fYear :

2015

Firstpage :

579

Lastpage :

586

Abstract :

We present a Bayesian reinforcement learning model with a working memory module which can solve some non-Markovian decision processes. The model is tested, and compared against SARSA (lambda), on a standard working-memory task from the psychology literature. Our method uses the Kalman temporal difference framework, And its extension to stochastic state transitions, to give posterior distributions over state-action values. This framework provides a natural mechanism for using reward information to update more than the current state-action pair, and thus negates the use of eligibility traces. Furthermore, the existence of full posterior distributions allows the use of Thompson sampling for action selection, which in turn removes the need to choose an appropriately parameterised action-selection method.

Keywords :

"Bayes methods","Kalman filters","Computational modeling","Mathematical model","Learning (artificial intelligence)","Psychology","Estimation"

Publisher :

ieee

Conference_Titel :

Computational Intelligence, 2015 IEEE Symposium Series on

Print_ISBN :

978-1-4799-7560-0

Type :

conf

DOI :

10.1109/SSCI.2015.91

Filename :

7376664

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3726539