Reinforcement Learning with Inertial Exploration

Author

Bergeron, Dany ; Desjardins, Charles ; Laumônier, Julien ; Chaib-Draa, Brahim

Author_Institution

Laval Univ., Laval

fYear

2007

fDate

2-5 Nov. 2007

Firstpage

277

Lastpage

280

Abstract

In the Q-learning framework, the exploration of large environment is influenced by the time credit assignment problem. In this context, abstraction techniques may be used. Thus, multi-step actions (MSA) Q-learning has been proposed to take advantage of the fact that few action switches are usually required in optimal policies. In this article, we propose the concept of inertial exploration, we apply a log-selection of the scales to MSA Q-learning and we go further by proposing a dynamic time scale approach. We demonstrate that the same improvement in learning speed can be achieved without the full scales set. This improvement is shown on the mountain car problem and on a more realistic application of vehicle control.

Keywords

learning (artificial intelligence); abstraction technique; dynamic time scale approach; inertial exploration; log-selection; mountain car problem; multistep actions Q-learning; reinforcement learning; time credit assignment problem; vehicle control; Computer science; Intelligent agent; Machine learning; Machine learning algorithms; Optimal control; Software engineering; Switches; Vehicle dynamics; Vehicle safety; Vehicles;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Agent Technology, 2007. IAT '07. IEEE/WIC/ACM International Conference on

Conference_Location

Fremont, CA

Print_ISBN

978-0-7695-3027-7

Type

conf

DOI

10.1109/IAT.2007.74

Filename

4407297