Title of article :
New algorithms of the Q-learning type
Author/Authors :
Shalabh Bhatnagar، نويسنده , , K. Mohan Babu، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2008
Pages :
9
From page :
1111
To page :
1119
Keywords :
Markov decision processes , reinforcement learning , SPSA , Q-learning , Two-timescale stochastic approximation
Journal title :
Automatica
Serial Year :
2008
Journal title :
Automatica
Record number :
370983
Link To Document :
بازگشت