• Title of article

    Reinforcement learning for long-run average cost

  • Author/Authors

    Abhijit Gosavi، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2004
  • Pages
    21
  • From page
    654
  • To page
    674
  • Keywords
    reinforcement learning , Stochastic processes , Two time scales
  • Journal title
    European Journal of Operational Research
  • Serial Year
    2004
  • Journal title
    European Journal of Operational Research
  • Record number

    214926