DocumentCode :
1709576
Title :
Potential based policy gradient approach for optimal control of the stochastic system with unknown noise
Author :
Cheng Kang ; Zhang Kanjian ; Fei Shumin ; Wei Haikun
Author_Institution :
Sch. of Autom., Southeast Univ., Nanjing, China
fYear :
2013
Firstpage :
2362
Lastpage :
2366
Abstract :
This paper considers optimal control problem of the discrete-time stochastic system, where the state space is continuous and the probability property of stochastic noise is unknown. First, the considered optimal control problem is transformed into a Markov Decision Process. Then, the performance potential based performance derivative formula can be applied for estimating the performance derivative with respect to the control parameters, which is the key of the policy gradient approach of this paper. For estimating the state transition probability density function (PDF) and the potential function, the RBF neural network is applied. With kn-Nearest Neighbor techniques, the sample pairs for training the RBF neural networks can be collected from a sample path, so that the policy gradient approach can be implemented on-line for practical application. The simulation shows the effectiveness of the proposed approach.
Keywords :
Markov processes; continuous systems; decision theory; discrete time systems; gradient methods; learning systems; neurocontrollers; noise; optimal control; performance index; probability; radial basis function networks; state-space methods; stochastic systems; Markov decision process; PDF; RBF neural network training; continuous state space; control parameters; discrete-time stochastic system; kn-nearest neighbor technique; optimal control problem; performance derivative estimation; performance potential based performance derivative formula; potential based policy gradient approach; probability property; sample pairs; state transition probability density function; unknown stochastic noise; Function approximation; Markov processes; Neural networks; Noise; Optimal control; Optimization; Markov Decision Processes; Optimal Control; Performance Potential; Policy Gradient; RBF Neural Network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2013 32nd Chinese
Conference_Location :
Xi´an
Type :
conf
Filename :
6639821
Link To Document :
بازگشت