• DocumentCode
    791705
  • Title

    An algorithm for learning without external supervision and its application to learning control systems

  • Author

    Nikolic, Z.J. ; Fu, K.S.

  • Author_Institution
    Purdue University, Lafayette, IN, USA
  • Volume
    11
  • Issue
    3
  • fYear
    1966
  • fDate
    7/1/1966 12:00:00 AM
  • Firstpage
    414
  • Lastpage
    422
  • Abstract
    An algorithm is proposed for the design of "on-line" learning controllers to control a discrete stochastic plant. The subjective probabilities of applying control actions from a finite set of allowable actions using random strategy, after any plant-environment situation (called an "event") is observed, are modified through the algorithm. The subjective probability for the optimal action is proved to approach one with probability one for any observed event. The optimized performance index is the conditional expectation of the instantaneous performance evaluations with respect to the observed events and the allowable actions. The algorithm is described through two transformations, T1and T2. After the "ordering transformation" T1is applied on the estimates of the performance indexes of the allowable actions, the "learning transformation" T2modifies the subjective probabilities. The cases of discrete and continuous features are considered. In the latter, the Potential Function Method is employed. The algorithm is compared with a linear reinforcement scheme and computer simulation results are presented.
  • Keywords
    Learning control systems; Linear systems, stochastic discrete-time; Algorithm design and analysis; Automatic control; Computer simulation; Control systems; Optimal control; Performance analysis; Random variables; Stochastic processes; Student members; Uncertainty;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.1966.1098345
  • Filename
    1098345