DocumentCode
249978
Title
DE Based Q-Learning Algorithm to Improve Speed of Convergence in Large Search Space Applications
Author
Rahaman, Zenefa ; Sil, J.
Author_Institution
Dept. of Comput. Sci. & Technol., Bengal Eng. & Sci. Univ., Howrah, India
fYear
2014
fDate
9-11 Jan. 2014
Firstpage
408
Lastpage
412
Abstract
The main drawback of reinforcement learning is that it learns nothing from an episode until it is over. So the learning procedure is very slow in case of large space applications. Differential Evolution (DE) algorithm is a population-based evolutionary optimization algorithm able to learn the search space in iterative way. In the paper, improvement of Q-learning method has been proposed using DE algorithm where guided randomness has been incorporated in the search space resulting fast convergence. Markov Decision Process (MDP), a mathematical framework has been used to model the problem in order to learn the large search space efficiently. The proposed algorithm exhibits better result in terms of speed and performance compare to basic Q-learning algorithm.
Keywords
Markov processes; evolutionary computation; learning (artificial intelligence); search problems; DE based Q-learning algorithm; MDP; Markov decision process; differential evolution algorithm; large search space applications; population-based evolutionary optimization algorithm; reinforcement learning; search space; Convergence; Heuristic algorithms; Optimization; Signal processing algorithms; Sociology; Statistics; Vectors; Convergence; Differential Evolution algorithm; Markov Decision Process; Q-Learning algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronic Systems, Signal Processing and Computing Technologies (ICESC), 2014 International Conference on
Conference_Location
Nagpur
Type
conf
DOI
10.1109/ICESC.2014.80
Filename
6745413
Link To Document