Title :
Reinforcement learning by backpropagation through an LSTM model/critic
Author_Institution :
Intelligent Syst. Lab. Amsterdam, Amsterdam Univ.
Abstract :
This paper describes backpropagation through an LSTM recurrent neural network model/critic, for reinforcement learning tasks in partially observable domains. This combines the advantage of LSTM´s strength at learning long-term temporal dependencies to infer states in partially observable tasks, with the advantage of being able to learn high-dimensional and/or continuous actions with backpropagation´s focused credit assignment mechanism
Keywords :
backpropagation; recurrent neural nets; backpropagation; credit assignment; recurrent neural network; reinforcement learning; Backpropagation; Dynamic programming; Intelligent networks; Intelligent systems; Laboratories; Learning systems; Neural networks; Observability; Recurrent neural networks; State-space methods;
Conference_Titel :
Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0706-0
DOI :
10.1109/ADPRL.2007.368179