Title of article :
Reinforcement learning with internal expectation in the random neural networks for cascaded decisions
Author/Authors :
Halici، Ugur نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2001
Pages :
-20
From page :
21
To page :
0
Abstract :
The reinforcement learning scheme proposed in Halici (J. Biosystems 40 (1997) 83) for the random neural network (RNN) (Neural Computation 1 (1989) 502) is based on reward and performs well for stationary environments. However, when the environment is not stationary it suffers from getting stuck to the previously learned action and extinction is not possible. To overcome the problem, the reinforcement scheme is extended in Halici (Eur. J. Oper. Res., 126(2000) 288) by introducing a new weight update rule (E-rule) which takes into consideration the internal expectation of reinforcement. Although the E-rule is proposed for the RNN, it can be used for training learning automata or other intelligent systems based on reinforcement learning. This paper looks into the behavior of the learning scheme with internal expectation for the environments where the reinforcement is obtained after a sequence of cascaded decisions. The simulation results have shown that the RNN learns well and extinction is possible even for the cases with several decision steps and with hundreds of possible decision paths.
Keywords :
Physics and evolution , Symbols and codes , Howard Pattee
Journal title :
BioSystems
Serial Year :
2001
Journal title :
BioSystems
Record number :
47789
Link To Document :
بازگشت