مرکز منطقه ای اطلاع رساني علوم و فناوري - Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem

DocumentCode :

3576073

Title :

Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem

Author :

Naiming Qi ; Xiaolei Sun ; Kang Sun ; Xingfu Liu ; Feng Wu ; Chao Liu

Author_Institution :

Sch. of Astronaut., Harbin Inst. of Technol., Harbin, China

fYear :

2014

Firstpage :

1750

Lastpage :

1754

Abstract :

A methodology is presented in this paper for stochastic optimal control of unmanned aerial vehicle performing the task of perimeter patrol. The optimal control problem is modeled as a Markov decision processes, and an approximate policy iteration algorithm is used for the cost-to-go function (value function) by introducing Gaussian process regression, resulting in improved quality of the decisions made while retaining computationally feasibility. The approximate dynamic programming (ADP) framework is developed to tackle the issues, in which situations standard dynamic programming algorithms become computationally too demanding. As a nonparametric ADP algorithm, the Gaussian processes that provide the combination of the prior and noise models presents a sub-solution in a lower dimensional space by exploiting kernel-based method. The numerical results that corroborate the effectiveness of the proposed methodology are also provided.

Keywords :

Gaussian processes; Markov processes; autonomous aerial vehicles; dynamic programming; iterative methods; optimal control; regression analysis; stochastic systems; ADP framework; Gaussian process regression; Markov decision process; approximate dynamic programming framework; approximate policy iteration algorithm; cost-to-go function; kernel-based method; lower dimensional space; nonparametric ADP algorithm; optimal control problem; perimeter patrol optimization problem; standard dynamic programming algorithm; stochastic optimal control; unmanned aerial vehicle; value function; Approximation algorithms; Approximation methods; Computational modeling; Dynamic programming; Gaussian processes; Ground penetrating radar; Heuristic algorithms; Gaussian process regression; Markov decision processes; approximate dynamic programming (ADP); perimeter patrol; unmanned air vehicle;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Mechatronics and Control (ICMC), 2014 International Conference on

Print_ISBN :

978-1-4799-2537-7

Type :

conf

DOI :

10.1109/ICMC.2014.7231861

Filename :

7231861

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3576073