DocumentCode :
2770389
Title :
Reward hierarchical temporal memory
Author :
Choi, Hansol ; Park, Jun-Cheol ; Lim, Jae Hyun ; Jun, Jae Young ; Kim, Dae-Shik
Author_Institution :
Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
fYear :
2012
fDate :
10-15 June 2012
Firstpage :
1
Lastpage :
7
Abstract :
In humans and animals, reward prediction error encoded by dopamine systems is thought to be important in the temporal difference learning class of reinforcement learning (RL). With RL algorithms, many brain models have described the function of dopamine and related areas, including the basal ganglia and frontal cortex. In spite of this importance, how the reward prediction error itself is computed is not understood well, including the problem of how the current states are assigned to a memorized states and how the values of the states are memorized. In this paper, we describe a neocortical model for memorizing state space and computing reward prediction error, known as `reward hierarchical temporal memory´ (rHTM). In this model, the temporal relationships among events are hierarchically stored. Using this memory, rHTM computes reward prediction errors by associating the memorized sequences to rewards and inhibits the predicted reward. In a simulation, our model behaved similarly to dopaminergic neurons. We suggest that our model can provide a hypothetical framework of interaction between cortex and dopamine neurons.
Keywords :
brain models; learning (artificial intelligence); neural nets; RL algorithms; basal ganglia; brain models; cortex-dopamine neuron interaction; dopamine systems; dopaminergic neurons; frontal cortex; neocortical model; rHTM; reinforcement learning; reward hierarchical temporal memory; reward prediction error computation; reward prediction error memorization; state space memorization; temporal difference learning class; Animals; Brain modeling; Computational modeling; Instruments; Neurons; Prediction algorithms; Predictive models; HTM; rHTM; reinforcement learning; reward; reward prediction error; reward-HTM; temporal difference;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
ISSN :
2161-4393
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
Type :
conf
DOI :
10.1109/IJCNN.2012.6252433
Filename :
6252433
Link To Document :
بازگشت