Title of article
Reinforcement learning for long-run average cost
Author/Authors
Abhijit Gosavi، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2004
Pages
21
From page
654
To page
674
Keywords
reinforcement learning , Stochastic processes , Two time scales
Journal title
European Journal of Operational Research
Serial Year
2004
Journal title
European Journal of Operational Research
Record number
214926
Link To Document