مرکز منطقه ای اطلاع رساني علوم و فناوري - Reinforcement learning by backpropagation through an LSTM model/critic

DocumentCode :

2717346

Title :

Reinforcement learning by backpropagation through an LSTM model/critic

Author :

Bakker, Bram

Author_Institution :

Intelligent Syst. Lab. Amsterdam, Amsterdam Univ.

fYear :

2007

fDate :

1-5 April 2007

Firstpage :

127

Lastpage :

134

Abstract :

This paper describes backpropagation through an LSTM recurrent neural network model/critic, for reinforcement learning tasks in partially observable domains. This combines the advantage of LSTM´s strength at learning long-term temporal dependencies to infer states in partially observable tasks, with the advantage of being able to learn high-dimensional and/or continuous actions with backpropagation´s focused credit assignment mechanism

Keywords :

backpropagation; recurrent neural nets; backpropagation; credit assignment; recurrent neural network; reinforcement learning; Backpropagation; Dynamic programming; Intelligent networks; Intelligent systems; Laboratories; Learning systems; Neural networks; Observability; Recurrent neural networks; State-space methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on

Conference_Location :

Honolulu, HI

Print_ISBN :

1-4244-0706-0

Type :

conf

DOI :

10.1109/ADPRL.2007.368179

Filename :

4220824

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2717346