DocumentCode :
3121831
Title :
Self-modifying reinforcement learning
Author :
Zhao, Jie-Yu
Author_Institution :
IST Res., Ningbo Univ., Zhejiang, China
Volume :
4
fYear :
2002
fDate :
4-5 Nov. 2002
Firstpage :
2146
Abstract :
We describe several experiments with reinforcement learning systems based on the technique of incremental self-improvement (IS). IS uses the success-story algorithm (SSA) to undo unrewarding policy changes computed by self-modifying policies. The experiment demonstrates IS´ advantages over stochastic hill climbing and TD Q-learning in noisy environments given limited computational resources.
Keywords :
learning (artificial intelligence); learning automata; stochastic automata; TD Q-learning; incremental self-improvement; noisy environments; self-modifying reinforcement learning; stochastic hill climbing; success-story algorithm; Acceleration; Genetic algorithms; Learning; Monitoring; Noise measurement; Performance evaluation; Stochastic processes; Testing; Time measurement; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
Type :
conf
DOI :
10.1109/ICMLC.2002.1175418
Filename :
1175418
Link To Document :
بازگشت