DocumentCode :
714070
Title :
Handling stochastic reward delays in machine reinforcement learning
Author :
Campbell, Jeffrey S. ; Givigi, Sidney N. ; Schwartz, Howard M.
Author_Institution :
Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, ON, Canada
fYear :
2015
fDate :
3-6 May 2015
Firstpage :
314
Lastpage :
319
Abstract :
The main contribution of this work is a novel learning algorithm for machine reinforcement learning when Poissonian stochastic time delays are present in the reinforcement signal. The novel approach can deal with rewards which may be received out of order in time or overlap with one another. A PID controller is simulated with and without a stochastic time delay to demonstrate the difficulties of the problem. Experimental results with mobile robots demonstrate that the proposed method improves the performance over that of traditional Q-learning for a learning agent in an environment with Poissonian-type stochastically delayed rewards.
Keywords :
delays; learning (artificial intelligence); mobile robots; stochastic systems; three-term control; PID controller; Poissonian stochastic time delays; learning agent; machine reinforcement learning; mobile robots; reinforcement signal; stochastic reward delays; traditional Q-learning; Delay effects; Delays; Learning (artificial intelligence); Mobile robots; Robot sensing systems; Stochastic processes; Markov Decision Process; Reinforcement learning; cost; jitter; reward; stochastic time delay;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on
Conference_Location :
Halifax, NS
ISSN :
0840-7789
Print_ISBN :
978-1-4799-5827-6
Type :
conf
DOI :
10.1109/CCECE.2015.7129295
Filename :
7129295
Link To Document :
بازگشت