DocumentCode :
2817893
Title :
Epsilon-optimal discretized pursuit learning automata
Author :
Oommen, B.J. ; Lanctot, Joseph K.
Author_Institution :
Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont., Canada
fYear :
1989
fDate :
14-17 Nov 1989
Firstpage :
6
Abstract :
The authors consider the problem of a stochastic learning automaton interacting with an unknown random environment. The fundamental problem is that of learning, through interaction, the best action (that is, the action which is rewarded optimally) allowed by the environment. By using running estimates of reward probabilities to learn the optimal action, an extremely efficient pursuit algorithm was obtained by M.A.L. Thathachar et al. (1986, 1989) which is presently among the fastest-growing algorithms known. In the present work, the authors investigate the improvements gained by rendering the pursuit algorithm discrete. This is done by restricting the probability of selecting an action to a finite and, hence, discrete subset of [0,1]. This improved scheme is proven to be ε-optimal in all stationary environments. Furthermore, the authors´ experimental results seem to indicate that the algorithm is the fastest-absorbing learning automaton reported in the literature to date. Comparison with the continuous form of the pursuit algorithm is also presented
Keywords :
learning systems; stochastic automata; epsilon-optimal discretised pursuit learning automata; reward probabilities; unknown random environment; Artificial intelligence; Biological system modeling; Computer science; Databases; Humans; Learning automata; Pattern recognition; Pursuit algorithms; Stochastic processes; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 1989. Conference Proceedings., IEEE International Conference on
Conference_Location :
Cambridge, MA
Type :
conf
DOI :
10.1109/ICSMC.1989.71244
Filename :
71244
Link To Document :
بازگشت