• DocumentCode
    404240
  • Title

    Learning, optimizing, and distributed decision making based on experience

  • Author

    Ho, Yu-Chi

  • Author_Institution
    Harvard Univ., Cambridge, MA, USA
  • Volume
    5
  • fYear
    2003
  • fDate
    9-12 Dec. 2003
  • Firstpage
    4818
  • Abstract
    We present a short and simplified "derivation" and discussion of perturbation analysis (PA), Markov decision problems (MDP), and reinforcement learning (RL) based on the sample path approach.
  • Keywords
    Markov processes; distributed decision making; learning (artificial intelligence); optimisation; perturbation techniques; Markov decision problems; distributed decision making; experience; optimization; perturbation analysis; reinforcement learning; sample path method; Contracts; Convergence; Costs; Distributed decision making; Dynamic programming; History; Learning; Stability analysis; State-space methods; Steady-state;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2003. Proceedings. 42nd IEEE Conference on
  • ISSN
    0191-2216
  • Print_ISBN
    0-7803-7924-1
  • Type

    conf

  • DOI
    10.1109/CDC.2003.1272352
  • Filename
    1272352