Title :
Reinforcement learning in a behaviour-based control architecture for marine archaeology
Author :
Gordon Frost;Francesco Maurelli;David M Lane
Author_Institution :
Ocean Systems Laboratory, School of Engineering &
fDate :
5/1/2015 12:00:00 AM
Abstract :
We present a novel path planner for adaptive behaviour of an Autonomous Underwater Vehicle (AUV). A behaviour-based architecture forms the foundation of the system with an extra layer which uses experience to learn a policy for modulating the behaviours´ weights. In effect, this creates an abstract environment for the Reinforement Learning (RL) agent´s state and action space. Subsequently, it simplifies the problem the RL agent is addressing, creating a more stable system. The Episodic Natural Actor Critic (ENAC) RL algorithm is used due to the continuous input and output domains and for the natural actor critic´s convergence properties. Adaptiveness of the system is presented in a thruster failure scenario. RL is used in this failure scenario to learn an appropriate policy for the behaviours´ weights under the new vehicle dynamics. We apply this control architecture to the domain of marine archaeology which has an inherent problem of navigation in unknown, potentially complex and dangerous environments. Simulated results of the proposed control architecture demonstrate its feasibility and performance.
Keywords :
"Vehicles","Modulation","Surges","Learning (artificial intelligence)","Machine learning algorithms","Robots","Adaptive systems"
Conference_Titel :
OCEANS 2015 - Genova
DOI :
10.1109/OCEANS-Genova.2015.7271619