DocumentCode :
424298
Title :
Policy gradient fuzzy reinforcement learning
Author :
Wang, Xue-ning ; Xu, Xin ; He, Han-gen
Author_Institution :
Inst. of Autom., Nat. Univ. of Defence Technol., Changsha, China
Volume :
2
fYear :
2004
fDate :
26-29 Aug. 2004
Firstpage :
992
Abstract :
This work presents a new approach for tuning conclusions of fuzzy rules based on reinforcement learning. Unlike the most of existing fuzzy reinforcement learning algorithms, which are based on value function, while our approach called policy gradient fuzzy reinforcement learning (PGFRL) bases on gradient estimate. In PGFRL, the algorithm GPOMDP is employed to estimate the performance gradient with respect to the parameters of fuzzy rules. In our work we prove the convergence of fuzzy rules´ parameters to a local optimum given necessary conditions. The experiment results show the effectiveness of PGFRL.
Keywords :
fuzzy control; gradient methods; learning (artificial intelligence); fuzzy control; fuzzy rules; gradient estimate; policy gradient fuzzy reinforcement learning; Computational modeling; Control systems; Convergence; Equations; Fuzzy control; Fuzzy neural networks; Fuzzy systems; Helium; Learning; State estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
Type :
conf
DOI :
10.1109/ICMLC.2004.1382332
Filename :
1382332
Link To Document :
بازگشت