Policy gradient fuzzy reinforcement learning

Author

Wang, Xue-ning ; Xu, Xin ; He, Han-gen

Author_Institution

Inst. of Autom., Nat. Univ. of Defence Technol., Changsha, China

Volume

fYear

2004

fDate

26-29 Aug. 2004

Firstpage

992

Abstract

This work presents a new approach for tuning conclusions of fuzzy rules based on reinforcement learning. Unlike the most of existing fuzzy reinforcement learning algorithms, which are based on value function, while our approach called policy gradient fuzzy reinforcement learning (PGFRL) bases on gradient estimate. In PGFRL, the algorithm GPOMDP is employed to estimate the performance gradient with respect to the parameters of fuzzy rules. In our work we prove the convergence of fuzzy rules´ parameters to a local optimum given necessary conditions. The experiment results show the effectiveness of PGFRL.

Keywords

fuzzy control; gradient methods; learning (artificial intelligence); fuzzy control; fuzzy rules; gradient estimate; policy gradient fuzzy reinforcement learning; Computational modeling; Control systems; Convergence; Equations; Fuzzy control; Fuzzy neural networks; Fuzzy systems; Helium; Learning; State estimation;

fLanguage

English

Publisher

ieee

Conference_Titel

Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on

Print_ISBN

0-7803-8403-2

Type

conf

DOI

10.1109/ICMLC.2004.1382332

Filename

1382332

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=424298