Title :
Approximate Q-Learning: An Introduction
Author :
Pandey, Deepshikha ; Pandey, Punit
Author_Institution :
Dept. of Comput. Sci. & Eng., Jaypee Inst. of Eng. & Technol., Guna, India
Abstract :
This paper introduces an approach to Q-learning algorithm with rough set theory introduced by Zdzislaw Pawlak in 1981. During Q-learning, an agent makes action selections in an effort to maximize a reward signal obtained from the environment. Based on reward, agent will make changes in its policy for future actions. The problem considered in this paper is the overestimation of expected value of cumulative future discounted rewards. This discounted reward is used in evaluating agent actions and policy during reinforcement learning. Due to the overestimation of discounted reward action evaluation and policy changes are not accurate. The solution to this problem results from a form Q-learning algorithm using a combination of approximation spaces and Q-learning to estimate the expected value of returns on actions. This is made possible by considering behavior patterns of an agent in scope of approximation spaces. The framework provided by an approximation space makes it possible to measure the degree that agent behaviors are a part of (´´covered by´´) a set of accepted agent behaviors that serve as a behavior evaluation norm.
Keywords :
learning (artificial intelligence); rough set theory; Q-learning algorithm; Zdzislaw Pawlak; behavior patterns; reinforcement learning; reward action evaluation; rough set theory; Approximation algorithms; Computer science; Extraterrestrial measurements; Learning systems; Machine learning; Machine learning algorithms; Optimization methods; Paper technology; Set theory; State estimation; Approximate Q-learning Algorithm; Overestimation of Q-Values; Q-learning Algorithm; Reinforcement Learning;
Conference_Titel :
Machine Learning and Computing (ICMLC), 2010 Second International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4244-6006-9
Electronic_ISBN :
978-1-4244-6007-6
DOI :
10.1109/ICMLC.2010.38