• DocumentCode
    353248
  • Title

    Dynamic programming with ARMA, Markov, and NARMA models vs. Q-learning-case study

  • Author

    Chrobak, J. ; Pacut, A. ; Karbowski, A.

  • Author_Institution
    Warsaw Univ. of Technol., Poland
  • Volume
    3
  • fYear
    2000
  • fDate
    27-27 July 2000
  • Firstpage
    265
  • Abstract
    Two approaches to control policy synthesis for unknown systems are investigated. The indirect approach is based on the identification of ARMA, NARMA, or Markov chain models, and applications of dynamic programming to these models with or without the use of a certainty equivalence principle. The direct approach is represented by Q-learning, with the lookup table or with the use of radial basis function approximation. We implemented both methods to optimization of a stock portfolio and tested on the Warsaw stock exchange data.
  • Keywords
    autoregressive moving average processes; dynamic programming; function approximation; hidden Markov models; investment; learning (artificial intelligence); probability; radial basis function networks; stock markets; table lookup; ARMA model; Markov chain models; NARMA models; Q-learning; dynamic programming; function approximation; lookup table; optimization; portfolio; probability; radial basis function neural networks; stock market; Computer aided software engineering; Control system synthesis; Dynamic programming; Function approximation; Investments; Optimization methods; Share prices; Stock markets; Table lookup; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
  • Conference_Location
    Como, Italy
  • ISSN
    1098-7576
  • Print_ISBN
    0-7695-0619-4
  • Type

    conf

  • DOI
    10.1109/IJCNN.2000.861314
  • Filename
    861314