Title :
Coarse planning for landmark navigation in a neural-network reinforcement-learning robot
Author :
Baldassarre, Gianluca
Author_Institution :
Dept. of Comput. Sci., Essex Univ., Colchester, UK
Abstract :
Is it possible to plan at a coarse level and act at a fine level with a neural-network (NN) reinforcement-learning (RL) planner? This work presents a NN planner, used to control a simulated robot in a stochastic landmark-navigation problem, which plans at an abstract level. The controller has both reactive components, based on actor-critic RL, and planning components inspired by the Dyna-PI architecture (this roughly corresponds to RL plus a model of the environment). Coarse planning is based on macro-actions defined as a sequence of identical primitive actions. It updates the evaluations and the action policy while generating simulated experience at the macro level with the model of the environment (a NN trained at the macro level). The simulations show how the controller works. They also show the advantages of using a discount coefficient tuned to the level of planning coarseness, and suggest that discounted RL has problems in dealing with long periods of time
Keywords :
computerised navigation; digital simulation; learning (artificial intelligence); mobile robots; neurocontrollers; path planning; Dyna-PI architecture; actor-critic RL; coarse planning; discount coefficient; landmark navigation; neural-network reinforcement-learning robot; planning components; stochastic landmark-navigation problem; Artificial intelligence; Computational modeling; Computer science; Costs; Intelligent robots; Navigation; Neural networks; Robot control; Robot sensing systems; Stochastic processes;
Conference_Titel :
Intelligent Robots and Systems, 2001. Proceedings. 2001 IEEE/RSJ International Conference on
Conference_Location :
Maui, HI
Print_ISBN :
0-7803-6612-3
DOI :
10.1109/IROS.2001.976428