مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

883950

Title :

Discretized pursuit learning automata

Author :

Oommen, B. John ; Lanctôt, J. Kevin

Author_Institution :

Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont., Canada

Volume :

Issue :

fYear :

1990

Firstpage :

931

Lastpage :

938

Abstract :

The problem of a stochastic learning automaton interacting with an unknown random environment is considered. The fundamental problem is that of learning, through interaction, the best action allowed by the environment (i.e. the action that is rewarded optimally). By using running estimates of reward probabilities to learn the optimal action, an extremely efficient pursuit algorithm (PA), which is presently among the fastest algorithms known, was reported in earlier works. The improvements gained by rendering the PA discrete are investigated. This is done by restricting the probability of selecting an action to a finite and, hence, discrete subset of [0, 1]. This improved scheme is proven to be ε-optimal in all stationary environments. Furthermore, the experimental results seem to indicate that the algorithm presented is faster than the fastest nonestimator learning automata reported to date, and also faster than the continuous pursuit automaton

Keywords :

learning systems; probability; stochastic automata; discrete subset; pursuit algorithm; reward probabilities; stochastic learning automaton; Aerospace control; Damping; Delay effects; Humans; Learning automata; Manipulator dynamics; Neuromuscular; Pursuit algorithms; Vehicle driving; Vehicle dynamics;

fLanguage :

English

Journal_Title :

Systems, Man and Cybernetics, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9472

Type :

jour

DOI :

10.1109/21.105092

Filename :

105092

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=883950