مرکز منطقه ای اطلاع رساني علوم و فناوري - Relative reward strength algorithms for learning automata

DocumentCode :

1142243

Title :

Relative reward strength algorithms for learning automata

Author :

Simha, Rahul ; Kurose, James F.

Author_Institution :

Dept. of Comput. & Inf. Sci., Massachusetts Univ., Amherst, MA, USA

Volume :

Issue :

fYear :

1989

Firstpage :

388

Lastpage :

398

Abstract :

A novel class of action probability update algorithms for learning automata that use the relative reward strengths of responses from the environment is examined. Specifically, update algorithms for S-model automata in which `recent´ environmental responses for each of the actions retained are used. A convergence result is proven and the behavior of these automat is studied by simulation. A major result is that the performance of these algorithms is superior, in several respects, to that of the well-known SL_R-1 update algorithm. Additional results are presented on the variability of performance, the cost of learning and, in the case of static environments, modifications that result in improved convergence

Keywords :

automata theory; probability; S-model automata; action probability update algorithms; convergence; learning automata; relative reward strengths; Algorithm design and analysis; Convergence; Costs; History; Information science; Learning automata; Probability distribution; Stochastic processes;

fLanguage :

English

Journal_Title :

Systems, Man and Cybernetics, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9472

Type :

jour

DOI :

10.1109/21.31041

Filename :

31041

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1142243