مرکز منطقه ای اطلاع رساني علوم و فناوري - A reinforcement learning based algorithm for Markov decision processes

DocumentCode :

2361123

Title :

A reinforcement learning based algorithm for Markov decision processes

Author :

Bhatnagar, Shalabh ; Kumar, Shishir

Author_Institution :

Dept. of Comput. Sci. & Autom., Indian Inst. of Sci., Bangalore, India

fYear :

2005

fDate :

4-7 Jan. 2005

Firstpage :

199

Lastpage :

204

Abstract :

A variant of a recently proposed two-timescale reinforcement learning based actor-critic algorithm for infinite horizon discounted cost Markov decision processes with finite state and compact action spaces is proposed. On the faster timescale, the value function corresponding to a given stationary deterministic policy is updated and averaged while the policy itself is updated on the slower scale. The latter recursion uses the sign of the gradient estimate instead of the estimate itself. A potential advantage in the use of sign function lies in significantly reduced computation and communication overheads in applications such as congestion control in communication networks and distributed computation. Convergence analysis of the algorithm is briefly sketched and numerical experiments for a problem of congestion control are presented.

Keywords :

Markov processes; convergence; decision theory; gradient methods; learning (artificial intelligence); actor-critic algorithm; communication network congestion control; compact action space; convergence analysis; distributed computation; finite state space; gradient estimation; infinite horizon discounted cost Mark-ov decision processes; reinforcement learning algorithm; stationary deterministic policy; Algorithm design and analysis; Communication networks; Communication system control; Computer networks; Convergence of numerical methods; Costs; Distributed computing; Infinite horizon; Learning; Recursive estimation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Sensing and Information Processing, 2005. Proceedings of 2005 International Conference on

Print_ISBN :

0-7803-8840-2

Type :

conf

DOI :

10.1109/ICISIP.2005.1529448

Filename :

1529448

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2361123