DocumentCode :
249978
Title :
DE Based Q-Learning Algorithm to Improve Speed of Convergence in Large Search Space Applications
Author :
Rahaman, Zenefa ; Sil, J.
Author_Institution :
Dept. of Comput. Sci. & Technol., Bengal Eng. & Sci. Univ., Howrah, India
fYear :
2014
fDate :
9-11 Jan. 2014
Firstpage :
408
Lastpage :
412
Abstract :
The main drawback of reinforcement learning is that it learns nothing from an episode until it is over. So the learning procedure is very slow in case of large space applications. Differential Evolution (DE) algorithm is a population-based evolutionary optimization algorithm able to learn the search space in iterative way. In the paper, improvement of Q-learning method has been proposed using DE algorithm where guided randomness has been incorporated in the search space resulting fast convergence. Markov Decision Process (MDP), a mathematical framework has been used to model the problem in order to learn the large search space efficiently. The proposed algorithm exhibits better result in terms of speed and performance compare to basic Q-learning algorithm.
Keywords :
Markov processes; evolutionary computation; learning (artificial intelligence); search problems; DE based Q-learning algorithm; MDP; Markov decision process; differential evolution algorithm; large search space applications; population-based evolutionary optimization algorithm; reinforcement learning; search space; Convergence; Heuristic algorithms; Optimization; Signal processing algorithms; Sociology; Statistics; Vectors; Convergence; Differential Evolution algorithm; Markov Decision Process; Q-Learning algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronic Systems, Signal Processing and Computing Technologies (ICESC), 2014 International Conference on
Conference_Location :
Nagpur
Type :
conf
DOI :
10.1109/ICESC.2014.80
Filename :
6745413
Link To Document :
بازگشت