مرکز منطقه ای اطلاع رساني علوم و فناوري - Higher-level application of Adaptive Dynamic Programming/Reinforcement Learning - a next phase for controls and system identification?

DocumentCode :

2498890

Title :

Higher-level application of Adaptive Dynamic Programming/Reinforcement Learning - a next phase for controls and system identification?

Author :

Lendaris, George G.

Author_Institution :

Syst. Sci. Grad. Program, Portland State Univ., Portland, OR, USA

fYear :

2011

fDate :

11-15 April 2011

Abstract :

In previous work it was shown that Adaptive-Critic-type Approximate Dynamic Programming could be applied in a “higher-level” way to create autonomous agents capable of using experience to discern context and select optimal, context-dependent control policies. Early experiments with this approach were based on full a priori knowledge of the system being monitored. The experiments reported in this paper, using small neural networks representing families of mappings, were designed to explore what happens when knowledge of the system is less precise. Results of these experiments show that agents trained with this approach perform well when subject to even large amounts of noise or when employing (slightly) imperfect models. The results also suggest that aspects of this method of context discernment are consistent with our intuition about human learning. The insights gained from these explorations can be used to guide further efforts for developing this approach into a general methodology for solving arbitrary identification and control problems.

Keywords :

adaptive control; dynamic programming; identification; learning (artificial intelligence); multi-agent systems; neurocontrollers; optimal control; adaptive dynamic programming; adaptive-critic-type approximate dynamic programming; autonomous agents; context-dependent control policy; higher-level application; neural network; optimal control; reinforcement learning; system identification; Artificial neural networks; Context; Context modeling; Humans; Process control; System identification; Training; Adaptive Dynamic Programming; adaptive critic; autonomous control; context; reinforcement learning; system identification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on

Conference_Location :

Paris

Print_ISBN :

978-1-4244-9887-1

Type :

conf

DOI :

10.1109/ADPRL.2011.5967395

Filename :

5967395

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2498890