DocumentCode :
813072
Title :
An optimal learning algorithm for S-model environments
Author :
Mason, L.G.
Author_Institution :
University of Saskatchewan, Saskatoon, Canada
Volume :
18
Issue :
5
fYear :
1973
fDate :
10/1/1973 12:00:00 AM
Firstpage :
493
Lastpage :
496
Abstract :
A class of stochastic automata models is proposed for the synthesis of a parameter optimizing controller. The automaton can operate in environments characterized by reward strengths ( S -models) or reward probabilities ( P -models). In the P -model case the proposed algorithm is equivalent to the ε-optimal algorithm reported by Shapiro and Narendra. The algorithm discussed here was originally reported by Mason with emphasis on the P -model case. In this paper, emphasis is placed on the S -model case. Recently, an equivalent ε-optimal algorithm has been reported by Viswanathan and Narendra. It is Shown herein that only the optimal solution is stable and that the expected performance converges monotonically. Simulation results are presented that corroborate the analytical results. It is demonstrated that the proposed algorithm is superior tO McLaren\´s linear reinforcement scheme in regard to expediency.
Keywords :
Learning control systems; Stochastic automata; Adaptive control; Automatic control; Control systems; Electrons; Environmental economics; Learning automata; Linear systems; Optimal control; Programmable control; Stochastic processes;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.1973.1100406
Filename :
1100406
Link To Document :
بازگشت