مرکز منطقه ای اطلاع رساني علوم و فناوري - Kernel-based reinforcement learning for traffic signal control with adaptive feature selection

DocumentCode :

114574

Title :

Kernel-based reinforcement learning for traffic signal control with adaptive feature selection

Author :

Tianshu Chu ; Jie Wang ; Jian Cao

Author_Institution :

Dept. of Civil & Environ. Eng., Stanford Univ., Stanford, CA, USA

fYear :

2014

fDate :

15-17 Dec. 2014

Firstpage :

1277

Lastpage :

1282

Abstract :

Reinforcement learning in a large-scale system is computationally challenging due to the curse of the dimensionality. One approach is to approximate the Q-function as a function of a state-action related feature vector, then learn the parameters instead. Although assumptions from the priori knowledge can potentially explore an appropriate feature vector, selecting a biased one that insufficiently represents the system usually leads to the poor learning performance. To avoid this disadvantage, this paper introduces kernel methods to implicitly propose a learnable feature vector instead of a pre-selected one. More specifically, the feature vector is estimated from a reference set which contains all critical state-action pairs observed so far, and it can be updated by either adding a new pair or replace an existing one in the reference set. Thus the approximate Q-function keeps adjusting itself as the knowledge about the system accumulates via observations. Our algorithm is designed in both batch mode and online mode in the context of the traffic signal control. In addition, the convergence of this algorithm is experimentally supported. Furthermore, some regularization methods are proposed to avoid overfitting of Q-function on the noisy observations. Finally, A simulation on the traffic signal control in a single intersection is provided, and the performance of this algorithm is compared with Q-learning, in which the Q-function is numerically estimated for each state-action pair without approximation.

Keywords :

feature selection; large-scale systems; learning systems; traffic control; vectors; Q-function; adaptive feature selection; batch mode; critical state-action pairs; kernel-based reinforcement learning; large-scale system; noisy observations; online mode; regularization methods; state-action related feature vector; traffic signal control; Algorithm design and analysis; Approximation algorithms; Approximation methods; Convergence; Joints; Kernel; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control (CDC), 2014 IEEE 53rd Annual Conference on

Conference_Location :

Los Angeles, CA

Print_ISBN :

978-1-4799-7746-8

Type :

conf

DOI :

10.1109/CDC.2014.7039557

Filename :

7039557

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=114574