• DocumentCode
    813072
  • Title

    An optimal learning algorithm for S-model environments

  • Author

    Mason, L.G.

  • Author_Institution
    University of Saskatchewan, Saskatoon, Canada
  • Volume
    18
  • Issue
    5
  • fYear
    1973
  • fDate
    10/1/1973 12:00:00 AM
  • Firstpage
    493
  • Lastpage
    496
  • Abstract
    A class of stochastic automata models is proposed for the synthesis of a parameter optimizing controller. The automaton can operate in environments characterized by reward strengths ( S -models) or reward probabilities ( P -models). In the P -model case the proposed algorithm is equivalent to the ε-optimal algorithm reported by Shapiro and Narendra. The algorithm discussed here was originally reported by Mason with emphasis on the P -model case. In this paper, emphasis is placed on the S -model case. Recently, an equivalent ε-optimal algorithm has been reported by Viswanathan and Narendra. It is Shown herein that only the optimal solution is stable and that the expected performance converges monotonically. Simulation results are presented that corroborate the analytical results. It is demonstrated that the proposed algorithm is superior tO McLaren\´s linear reinforcement scheme in regard to expediency.
  • Keywords
    Learning control systems; Stochastic automata; Adaptive control; Automatic control; Control systems; Electrons; Environmental economics; Learning automata; Linear systems; Optimal control; Programmable control; Stochastic processes;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.1973.1100406
  • Filename
    1100406