Title :
DE Based Q-Learning Algorithm to Improve Speed of Convergence in Large Search Space Applications
Author :
Rahaman, Zenefa ; Sil, J.
Author_Institution :
Dept. of Comput. Sci. & Technol., Bengal Eng. & Sci. Univ., Howrah, India
Abstract :
The main drawback of reinforcement learning is that it learns nothing from an episode until it is over. So the learning procedure is very slow in case of large space applications. Differential Evolution (DE) algorithm is a population-based evolutionary optimization algorithm able to learn the search space in iterative way. In the paper, improvement of Q-learning method has been proposed using DE algorithm where guided randomness has been incorporated in the search space resulting fast convergence. Markov Decision Process (MDP), a mathematical framework has been used to model the problem in order to learn the large search space efficiently. The proposed algorithm exhibits better result in terms of speed and performance compare to basic Q-learning algorithm.
Keywords :
Markov processes; evolutionary computation; learning (artificial intelligence); search problems; DE based Q-learning algorithm; MDP; Markov decision process; differential evolution algorithm; large search space applications; population-based evolutionary optimization algorithm; reinforcement learning; search space; Convergence; Heuristic algorithms; Optimization; Signal processing algorithms; Sociology; Statistics; Vectors; Convergence; Differential Evolution algorithm; Markov Decision Process; Q-Learning algorithm;
Conference_Titel :
Electronic Systems, Signal Processing and Computing Technologies (ICESC), 2014 International Conference on
Conference_Location :
Nagpur
DOI :
10.1109/ICESC.2014.80