• DocumentCode
    1300638
  • Title

    Pattern-recognizing stochastic learning automata

  • Author

    Barto, Andrew G. ; Anandan, P.

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Massachusetts Univ., Amherst, MA, USA
  • Issue
    3
  • fYear
    1985
  • Firstpage
    360
  • Lastpage
    375
  • Abstract
    A class of learning tasks is described that combines aspects of learning automation tasks and supervised learning pattern-classification tasks. These tasks are called associative reinforcement learning tasks. An algorithm is presented, called the associative reward-penalty, or AR-P algorithm for which a form of optimal performance is proved. This algorithm simultaneously generalizes a class of stochastic learning automata and a class of supervised learning pattern-classification methods related to the Robbins-Monro stochastic approximation procedure. The relevance of this hybrid algorithm is discussed with respect to the collective behaviour of learning automata and the behaviour of networks of pattern-classifying adaptive elements. Simulation results are presented that illustrate the associative reinforcement learning task and the performance of the AR-P algorithm as compared with that of several existing algorithms.
  • Keywords
    learning systems; pattern recognition; stochastic automata; Robbins-Monro stochastic approximation; associative reward-penalty; pattern recognition; pattern-classification; stochastic learning automata; supervised learning; Approximation algorithms; Classification algorithms; Learning; Learning automata; Supervised learning; Vectors;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9472
  • Type

    jour

  • DOI
    10.1109/TSMC.1985.6313371
  • Filename
    6313371