Title :
Multi-objective reinforcement learning for AUV thruster failure recovery
Author :
Ahmadzadeh, Seyed Reza ; Kormushev, Petar ; Caldwell, D.G.
Author_Institution :
Dept. of Adv. Robot., Ist. Italiano di Tecnol., Genoa, Italy
Abstract :
This paper investigates learning approaches for discovering fault-tolerant control policies to overcome thruster failures in Autonomous Underwater Vehicles (AUV). The proposed approach is a model-based direct policy search that learns on an on-board simulated model of the vehicle. When a fault is detected and isolated the model of the AUV is reconfigured according to the new condition. To discover a set of optimal solutions a multi-objective reinforcement learning approach is employed which can deal with multiple conflicting objectives. Each optimal solution can be used to generate a trajectory that is able to navigate the AUV towards a specified target while satisfying multiple objectives. The discovered policies are executed on the robot in a closed-loop using AUV´s state feedback. Unlike most existing methods which disregard the faulty thruster, our approach can also deal with partially broken thrusters to increase the persistent autonomy of the AUV. In addition, the proposed approach is applicable when the AUV either becomes under-actuated or remains redundant in the presence of a fault. We validate the proposed approach on the model of the Girona500 AUV.
Keywords :
autonomous underwater vehicles; closed loop systems; control engineering computing; fault diagnosis; learning (artificial intelligence); mobile robots; optimal control; state feedback; AUV state feedback; AUV thruster failure recovery; Girona500 AUV; autonomous underwater vehicles; closed-loop; conflicting objective; fault detection; fault-tolerant control policy; faulty thruster; model-based direct policy search; multiobjective reinforcement learning approach; on-board simulated model; optimal solution; Optimization; Sociology; Statistics; Trajectory; Vectors; Vehicle dynamics; Vehicles;
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/ADPRL.2014.7010621