مرکز منطقه ای اطلاع رساني علوم و فناوري - Data-driven learning and control with multiple critic networks

DocumentCode :

577600

Title :

Data-driven learning and control with multiple critic networks

Author :

He, Haibo ; Ni, Zhen ; Zhao, Dongbin

Author_Institution :

Dept. of Electr., Comput. & Biomed. Eng., Univ. of Rhode Island, Kingston, RI, USA

fYear :

2012

fDate :

6-8 July 2012

Firstpage :

523

Lastpage :

527

Abstract :

In this paper, we extend our previous work of a three-network adaptive dynamic programming design [1] to be a multiple critic networks design for online learning and control. The key idea of this approach is to develop a hierarchical internal goal representation to facilitate the online learning with detailed and informative internal value signal representations. We present our learning architecture in detail, and also demonstrate its performance on the popular cart-pole balancing benchmark. Simulation results demonstrate the effectiveness of our approach. We also present discussions of further research directions along this topic.

Keywords :

control engineering computing; dynamic programming; learning (artificial intelligence); cart-pole balancing benchmark; data-driven learning; hierarchical internal goal representation; informative internal value signal representation; learning architecture; multiple critic networks design; online learning; three-network adaptive dynamic programming design; Adaptive systems; Benchmark testing; Computer architecture; Dynamic programming; Helium; Nickel; Vectors; adaptive dynamic programming (ADP); external reinforcement signal; goal representation; hierarchical structure; internal reinforcement signal; multiple critic networks;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Control and Automation (WCICA), 2012 10th World Congress on

Conference_Location :

Beijing

Print_ISBN :

978-1-4673-1397-1

Type :

conf

DOI :

10.1109/WCICA.2012.6357935

Filename :

6357935

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=577600