مرکز منطقه ای اطلاع رساني علوم و فناوري - An optimal learning algorithm for S-model environments

DocumentCode :

813072

Title :

An optimal learning algorithm for S-model environments

Author :

Mason, L.G.

Author_Institution :

University of Saskatchewan, Saskatoon, Canada

Volume :

Issue :

fYear :

1973

fDate :

10/1/1973 12:00:00 AM

Firstpage :

493

Lastpage :

496

Abstract :

A class of stochastic automata models is proposed for the synthesis of a parameter optimizing controller. The automaton can operate in environments characterized by reward strengths ( $S$ -models) or reward probabilities ( $P$ -models). In the $P$ -model case the proposed algorithm is equivalent to the ε-optimal algorithm reported by Shapiro and Narendra. The algorithm discussed here was originally reported by Mason with emphasis on the $P$ -model case. In this paper, emphasis is placed on the $S$ -model case. Recently, an equivalent ε-optimal algorithm has been reported by Viswanathan and Narendra. It is Shown herein that only the optimal solution is stable and that the expected performance converges monotonically. Simulation results are presented that corroborate the analytical results. It is demonstrated that the proposed algorithm is superior tO McLaren\´s linear reinforcement scheme in regard to expediency.

Keywords :

Learning control systems; Stochastic automata; Adaptive control; Automatic control; Control systems; Electrons; Environmental economics; Learning automata; Linear systems; Optimal control; Programmable control; Stochastic processes;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/TAC.1973.1100406

Filename :

1100406

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=813072