Title :
On using distribution theory to prove the epsilon-optimality of stubborn learning mechanisms
Author :
Christensen, J.P.R. ; Oommen, B.J.
Author_Institution :
Copenhagen Telephone Co., Kobenhavn, Denmark
Abstract :
The authors consider the problem of a learning mechanism learning the optimal action offered by a random environment. The mechanism presented can be defined as an action probability updating rule and thus a variable-structure stochastic automaton. The machine is essentially a stubborn machine; in other words, once the machine has chosen a particular action it increases the probability of choosing the action irrespective of whether the response from the environment was favorable or unfavorable. However, this increase in the action probability is done in a systematic and methodical way so that the machine learns, in an ε-optimal fashion, the best action which the environment offers. The proposed mechanism forms an excellent model for an ε-optimal stubbornly learning system. Apart from the fact that the machine is shown to be ε-optimal, a major contribution of the present work is that the mathematical tools used in this proof (namely the theory of distributions, kernels, and topological spaces) are quite distinct from those which are currently used in the field of learning. Also presented are simulation results which demonstrate the properties of the mechanism and which compare it to the traditional LRI scheme
Keywords :
automata theory; learning systems; probability; ϵ-optimal stubbornly learning system; LRI scheme; action probability updating rule; distribution theory; random environment; variable-structure stochastic automaton; Biological system modeling; Learning automata; Learning systems; Machine learning; Mathematical model; Mechanical factors; Organisms; Psychology; Stochastic processes; Telephony;
Conference_Titel :
Systems, Man and Cybernetics, 1989. Conference Proceedings., IEEE International Conference on
Conference_Location :
Cambridge, MA
DOI :
10.1109/ICSMC.1989.71299