مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2770389

Title :

Reward hierarchical temporal memory

Author :

Choi, Hansol ; Park, Jun-Cheol ; Lim, Jae Hyun ; Jun, Jae Young ; Kim, Dae-Shik

Author_Institution :

Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea

fYear :

2012

fDate :

10-15 June 2012

Firstpage :

Lastpage :

Abstract :

In humans and animals, reward prediction error encoded by dopamine systems is thought to be important in the temporal difference learning class of reinforcement learning (RL). With RL algorithms, many brain models have described the function of dopamine and related areas, including the basal ganglia and frontal cortex. In spite of this importance, how the reward prediction error itself is computed is not understood well, including the problem of how the current states are assigned to a memorized states and how the values of the states are memorized. In this paper, we describe a neocortical model for memorizing state space and computing reward prediction error, known as `reward hierarchical temporal memory´ (rHTM). In this model, the temporal relationships among events are hierarchically stored. Using this memory, rHTM computes reward prediction errors by associating the memorized sequences to rewards and inhibits the predicted reward. In a simulation, our model behaved similarly to dopaminergic neurons. We suggest that our model can provide a hypothetical framework of interaction between cortex and dopamine neurons.

Keywords :

brain models; learning (artificial intelligence); neural nets; RL algorithms; basal ganglia; brain models; cortex-dopamine neuron interaction; dopamine systems; dopaminergic neurons; frontal cortex; neocortical model; rHTM; reinforcement learning; reward hierarchical temporal memory; reward prediction error computation; reward prediction error memorization; state space memorization; temporal difference learning class; Animals; Brain modeling; Computational modeling; Instruments; Neurons; Prediction algorithms; Predictive models; HTM; rHTM; reinforcement learning; reward; reward prediction error; reward-HTM; temporal difference;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), The 2012 International Joint Conference on

Conference_Location :

Brisbane, QLD

ISSN :

2161-4393

Print_ISBN :

978-1-4673-1488-6

Electronic_ISBN :

2161-4393

Type :

conf

DOI :

10.1109/IJCNN.2012.6252433

Filename :

6252433

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2770389