Title of article :
Reinforcement learning for long-run average cost
Author/Authors :
Abhijit Gosavi، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Keywords :
reinforcement learning , Stochastic processes , Two time scales
Journal title :
European Journal of Operational Research
Journal title :
European Journal of Operational Research