DocumentCode :
2585606
Title :
Control by Gradient Collocation: Applications to optimal obstacle avoidance and minimum torque control
Author :
Ruvolo, Paul ; Wu, Tingfan ; Movellan, Javier R.
Author_Institution :
Machine Perception Lab., Univ. of California, San Diego, La Jolla, CA, USA
fYear :
2012
fDate :
7-12 Oct. 2012
Firstpage :
1173
Lastpage :
1179
Abstract :
We present a new machine learning algorithm for learning optimal feedback control policies to guide a robot to a goal in the presence of obstacles. Our method works by first reducing the problem of obstacle avoidance to a continuous state, action, and time control problem, and then uses efficient collocation methods to solve for an optimal feedback control policy. This formulation of the obstacle avoidance problem improves over standard approaches, such as potential field methods, by being resistant to local minima, allowing for moving obstacles, handling stochastic systems, and computing feedback control strategies that take into account the robot´s (possibly non-linear) dynamics. In addition to contributing a new method for obstacle avoidance, our work contributes to the state-of-the-art in collocation methods for non-linear stochastic optimal control problems in two important ways: (1) we show that taking into account local gradient and second-order derivative information of the optimal value function at the collocation points allows us to exploit knowledge of the derivative information about the system dynamics, and (2) we show that computational savings can be achieved by directly fitting the gradient of the optimal value function rather than the optimal value function itself. We validate our approach on three problems: non-convex obstacle avoidance of a point-mass robot, obstacle avoidance for a 2 degree of freedom robotic manipulator, and optimal control of a non-linear dynamical system.
Keywords :
collision avoidance; feedback; gradient methods; learning (artificial intelligence); manipulators; nonlinear dynamical systems; optimal control; torque control; continuous state; gradient collocation; machine learning algorithm; minimum torque control; nonlinear dynamical system; optimal feedback control policies; optimal obstacle avoidance; optimal value function itself; point-mass robot; potential field methods; robotic manipulator; second-order derivative information; time control problem; Collision avoidance; Equations; Least squares approximation; Mathematical model; Optimal control; Robot kinematics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on
Conference_Location :
Vilamoura
ISSN :
2153-0858
Print_ISBN :
978-1-4673-1737-5
Type :
conf
DOI :
10.1109/IROS.2012.6385556
Filename :
6385556
Link To Document :
بازگشت