Title of article :
Reinforcement learning for long-run average cost
Author/Authors :
Abhijit Gosavi، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Pages :
21
From page :
654
To page :
674
Keywords :
reinforcement learning , Stochastic processes , Two time scales
Journal title :
European Journal of Operational Research
Serial Year :
2004
Journal title :
European Journal of Operational Research
Record number :
214926
Link To Document :
بازگشت