DocumentCode :
3576073
Title :
Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem
Author :
Naiming Qi ; Xiaolei Sun ; Kang Sun ; Xingfu Liu ; Feng Wu ; Chao Liu
Author_Institution :
Sch. of Astronaut., Harbin Inst. of Technol., Harbin, China
fYear :
2014
Firstpage :
1750
Lastpage :
1754
Abstract :
A methodology is presented in this paper for stochastic optimal control of unmanned aerial vehicle performing the task of perimeter patrol. The optimal control problem is modeled as a Markov decision processes, and an approximate policy iteration algorithm is used for the cost-to-go function (value function) by introducing Gaussian process regression, resulting in improved quality of the decisions made while retaining computationally feasibility. The approximate dynamic programming (ADP) framework is developed to tackle the issues, in which situations standard dynamic programming algorithms become computationally too demanding. As a nonparametric ADP algorithm, the Gaussian processes that provide the combination of the prior and noise models presents a sub-solution in a lower dimensional space by exploiting kernel-based method. The numerical results that corroborate the effectiveness of the proposed methodology are also provided.
Keywords :
Gaussian processes; Markov processes; autonomous aerial vehicles; dynamic programming; iterative methods; optimal control; regression analysis; stochastic systems; ADP framework; Gaussian process regression; Markov decision process; approximate dynamic programming framework; approximate policy iteration algorithm; cost-to-go function; kernel-based method; lower dimensional space; nonparametric ADP algorithm; optimal control problem; perimeter patrol optimization problem; standard dynamic programming algorithm; stochastic optimal control; unmanned aerial vehicle; value function; Approximation algorithms; Approximation methods; Computational modeling; Dynamic programming; Gaussian processes; Ground penetrating radar; Heuristic algorithms; Gaussian process regression; Markov decision processes; approximate dynamic programming (ADP); perimeter patrol; unmanned air vehicle;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mechatronics and Control (ICMC), 2014 International Conference on
Print_ISBN :
978-1-4799-2537-7
Type :
conf
DOI :
10.1109/ICMC.2014.7231861
Filename :
7231861
Link To Document :
بازگشت