DocumentCode :
1799323
Title :
A data-based online reinforcement learning algorithm with high-efficient exploration
Author :
Yuanheng Zhu ; Dongbin Zhao
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Instn. of Autom., Beijing, China
fYear :
2014
fDate :
9-12 Dec. 2014
Firstpage :
1
Lastpage :
6
Abstract :
An online reinforcement learning algorithm is proposed in this paper to directly utilizes online data efficiently for continuous deterministic systems without system parameters. The dependence on some specific approximation structures is crucial to limit the wide application of online reinforcement learning algorithms. We utilize the online data directly with the kd-tree technique to remove this limitation. Moreover, we design the algorithm in the Probably Approximately Correct principle. Two examples are simulated to verify its good performance.
Keywords :
data handling; learning (artificial intelligence); tree data structures; approximation structures; continuous deterministic systems; data-based online reinforcement learning algorithm; high-efficient exploration; kd-tree technique; probably approximately correct principle; Approximation algorithms; Approximation methods; DC motors; Learning (artificial intelligence); Optimal control; Partitioning algorithms; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
Type :
conf
DOI :
10.1109/ADPRL.2014.7010631
Filename :
7010631
Link To Document :
بازگشت