DocumentCode
813072
Title
An optimal learning algorithm for S-model environments
Author
Mason, L.G.
Author_Institution
University of Saskatchewan, Saskatoon, Canada
Volume
18
Issue
5
fYear
1973
fDate
10/1/1973 12:00:00 AM
Firstpage
493
Lastpage
496
Abstract
A class of stochastic automata models is proposed for the synthesis of a parameter optimizing controller. The automaton can operate in environments characterized by reward strengths (
-models) or reward probabilities (
-models). In the
-model case the proposed algorithm is equivalent to the ε-optimal algorithm reported by Shapiro and Narendra. The algorithm discussed here was originally reported by Mason with emphasis on the
-model case. In this paper, emphasis is placed on the
-model case. Recently, an equivalent ε-optimal algorithm has been reported by Viswanathan and Narendra. It is Shown herein that only the optimal solution is stable and that the expected performance converges monotonically. Simulation results are presented that corroborate the analytical results. It is demonstrated that the proposed algorithm is superior tO McLaren\´s linear reinforcement scheme in regard to expediency.
-models) or reward probabilities (
-models). In the
-model case the proposed algorithm is equivalent to the ε-optimal algorithm reported by Shapiro and Narendra. The algorithm discussed here was originally reported by Mason with emphasis on the
-model case. In this paper, emphasis is placed on the
-model case. Recently, an equivalent ε-optimal algorithm has been reported by Viswanathan and Narendra. It is Shown herein that only the optimal solution is stable and that the expected performance converges monotonically. Simulation results are presented that corroborate the analytical results. It is demonstrated that the proposed algorithm is superior tO McLaren\´s linear reinforcement scheme in regard to expediency.Keywords
Learning control systems; Stochastic automata; Adaptive control; Automatic control; Control systems; Electrons; Environmental economics; Learning automata; Linear systems; Optimal control; Programmable control; Stochastic processes;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/TAC.1973.1100406
Filename
1100406
Link To Document