DocumentCode
2302086
Title
Fuzzy Rule Interpolation-based Q-learning
Author
Vincze, Dávid ; Kovács, Szilveszter
Author_Institution
Dept. of Inf. Technol., Univ. of Miskolc, Miskolc, Hungary
fYear
2009
fDate
28-29 May 2009
Firstpage
55
Lastpage
60
Abstract
Reinforcement learning is a well known topic in computational intelligence. It can be used to solve control problems in unknown environments without defining an exact method on how to solve problems in various situations. Instead the goal is defined and all the actions done in the different states are given feedback, called reward or punishment (positive or negative reward). Based on these rewards the system can learn which action is considered the best in a given state. A method called Q-learning can be used for building up the state-action-value function. This method uses discrete states. With the application of fuzzy reasoning the method can be extended to be used in continuous environment, called Fuzzy Q-learning (FQ-Learning). Traditional Fuzzy Q-learning uses 0-order Takagi-Sugeno fuzzy inference. The main goal of this paper is to introduce Fuzzy Rule Interpolation (FRI), namely the FIVE (Fuzzy rule Interpolation based on Vague Environment) to be the model applied with Q-learning (FRIQ-learning). The paper also includes an application example: the well known cart pole (reversed pendulum) problem is used for demonstrating the applicability of the FIVE model in Q-learning.
Keywords
fuzzy reasoning; fuzzy set theory; interpolation; learning (artificial intelligence); Takagi-Sugeno fuzzy inference; fuzzy Q-learning method; fuzzy reasoning; fuzzy rule interpolation; reinforcement learning; state-action-value function; Computational intelligence; Distributed computing; Fuzzy reasoning; Informatics; Information technology; Interpolation; Learning; Negative feedback; State feedback; Takagi-Sugeno model;
fLanguage
English
Publisher
ieee
Conference_Titel
Applied Computational Intelligence and Informatics, 2009. SACI '09. 5th International Symposium on
Conference_Location
Timisoara
Print_ISBN
978-1-4244-4477-9
Electronic_ISBN
978-1-4244-4478-6
Type
conf
DOI
10.1109/SACI.2009.5136311
Filename
5136311
Link To Document