Title :
Theoretical analysis of a reinforcement learning based switching scheme
Author_Institution :
Mech. Eng. Dept., South Dakota Sch. of Mines & Technol., Rapid City, SD, USA
Abstract :
A reinforcement learning based scheme for optimal switching with an infinite-horizon cost function is briefly proposed in this paper. Several theoretical questions are shown to arise regarding its convergence, optimality of the result, and continuity of the limit function, to be uniformly approximated using parametric function approximators. The main contribution of the paper is providing rigorous answers for the questions, where, sufficient conditions for convergence, optimality, and continuity are provided.
Keywords :
function approximation; learning (artificial intelligence); infinite-horizon cost function; optimal switching; parametric function approximators; reinforcement learning based switching scheme; Approximation methods; Artificial neural networks; Convergence; Cost function; Optimal control; Schedules; Switches;
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/ADPRL.2014.7010614