DocumentCode :
1300638
Title :
Pattern-recognizing stochastic learning automata
Author :
Barto, Andrew G. ; Anandan, P.
Author_Institution :
Dept. of Comput. & Inf. Sci., Massachusetts Univ., Amherst, MA, USA
Issue :
3
fYear :
1985
Firstpage :
360
Lastpage :
375
Abstract :
A class of learning tasks is described that combines aspects of learning automation tasks and supervised learning pattern-classification tasks. These tasks are called associative reinforcement learning tasks. An algorithm is presented, called the associative reward-penalty, or AR-P algorithm for which a form of optimal performance is proved. This algorithm simultaneously generalizes a class of stochastic learning automata and a class of supervised learning pattern-classification methods related to the Robbins-Monro stochastic approximation procedure. The relevance of this hybrid algorithm is discussed with respect to the collective behaviour of learning automata and the behaviour of networks of pattern-classifying adaptive elements. Simulation results are presented that illustrate the associative reinforcement learning task and the performance of the AR-P algorithm as compared with that of several existing algorithms.
Keywords :
learning systems; pattern recognition; stochastic automata; Robbins-Monro stochastic approximation; associative reward-penalty; pattern recognition; pattern-classification; stochastic learning automata; supervised learning; Approximation algorithms; Classification algorithms; Learning; Learning automata; Supervised learning; Vectors;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9472
Type :
jour
DOI :
10.1109/TSMC.1985.6313371
Filename :
6313371
Link To Document :
بازگشت