Title :
A hierarchical reinforcement learning algorithm based on heuristic reward function
Author :
Yan, Qicui ; Liu, Quan ; Hu, Daojing
Author_Institution :
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Abstract :
A hierarchical reinforcement learning method based on heuristic reward function is proposed to solve the problem of “curse of dimensionality”, that is the states space will grow exponentially in the number of features, and low convergence speed. The method can reduce state spaces greatly and can enhance the speed of the study. Choose actions with favorable purpose and efficiency so as to optimize reward function and enhance convergence speed. Apply this method to the Tetris game; the experiment result shows that the method can partly solve the “curse of dimensionality” and can enhance the convergence speed prominent.
Keywords :
computer games; convergence; learning (artificial intelligence); optimisation; Tetris game; convergence speed; dimensionality curse; heuristic reward function; hierarchical reinforcement learning algorithm; reward function optimization; Computer science; Control theory; Convergence; Function approximation; Heuristic algorithms; Learning systems; Machine learning; Space technology; State-space methods; Statistics; Tetris; curse of dimensionality; heuristic reward function; hierarchical reinforcement learning;
Conference_Titel :
Advanced Computer Control (ICACC), 2010 2nd International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-1-4244-5845-5
DOI :
10.1109/ICACC.2010.5486837