Title :
Policy gradient fuzzy reinforcement learning
Author :
Wang, Xue-ning ; Xu, Xin ; He, Han-gen
Author_Institution :
Inst. of Autom., Nat. Univ. of Defence Technol., Changsha, China
Abstract :
This work presents a new approach for tuning conclusions of fuzzy rules based on reinforcement learning. Unlike the most of existing fuzzy reinforcement learning algorithms, which are based on value function, while our approach called policy gradient fuzzy reinforcement learning (PGFRL) bases on gradient estimate. In PGFRL, the algorithm GPOMDP is employed to estimate the performance gradient with respect to the parameters of fuzzy rules. In our work we prove the convergence of fuzzy rules´ parameters to a local optimum given necessary conditions. The experiment results show the effectiveness of PGFRL.
Keywords :
fuzzy control; gradient methods; learning (artificial intelligence); fuzzy control; fuzzy rules; gradient estimate; policy gradient fuzzy reinforcement learning; Computational modeling; Control systems; Convergence; Equations; Fuzzy control; Fuzzy neural networks; Fuzzy systems; Helium; Learning; State estimation;
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
DOI :
10.1109/ICMLC.2004.1382332