DocumentCode :
1799320
Title :
Data-driven partially observable dynamic processes using adaptive dynamic programming
Author :
Xiangnan Zhong ; Zhen Ni ; Yufei Tang ; Haibo He
Author_Institution :
Dept. of Electr., Univ. of Rhode Island, Kingston, RI, USA
fYear :
2014
fDate :
9-12 Dec. 2014
Firstpage :
1
Lastpage :
8
Abstract :
Adaptive dynamic programming (ADP) has been widely recognized as one of the “core methodologies” to achieve optimal control for intelligent systems in Markov decision process (MDP). Generally, ADP control design requires all the information of the system dynamics. However, in many practical situations, the measured input and output data can only represent part of the system states. This means the complete information of the system cannot be available in many real-world cases, which narrows the range of application of the ADP design. In this paper, we propose a data-driven ADP method to stabilize the system with partially observable dynamics based on neural network techniques. A state network is integrated into the typical actor-critic architecture to provide an estimated state from the measured input/output sequences. The theoretical analysis and the stability discussion of this data-driven ADP method are also provided. Two examples are studied to verify our proposed method.
Keywords :
Markov processes; control system synthesis; dynamic programming; neurocontrollers; optimal control; ADP; MDP; Markov decision process; actor-critic architecture; adaptive dynamic programming; control design; core methodologies; data-driven partially observable dynamic process; intelligent systems; neural network techniques; optimal control; system dynamics; Dynamic programming; Equations; Markov processes; Neural networks; Optimal control; Performance analysis; Stability analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
Type :
conf
DOI :
10.1109/ADPRL.2014.7010628
Filename :
7010628
Link To Document :
بازگشت