An ant system based exploration-exploitation for reinforcement learning

Author

Chang, Hyeong Soo

Author_Institution

Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea

Volume

fYear

2004

fDate

10-13 Oct. 2004

Firstpage

3805

Abstract

In this paper, we develop a novel exploration-exploitation strategy for reinforcement learning based on ant colony system. Most of the exploration-exploitation strategies use some statistics extracted from a single simulated trajectory. The novel strategy uses some statistics extracted from multiple simulated trajectories obtained from a swarm of ants. We show that the strategy preserves the convergence property of Q-learning.

Keywords

combinatorial mathematics; learning (artificial intelligence); optimisation; Q-learning; ant colony system; convergence property; exploration-exploitation strategy; reinforcement learning; single simulated trajectory; Ant colony optimization; Biological information theory; Biological system modeling; Birds; Casting; Computational modeling; Computer science; Learning; Marine animals; Statistics;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man and Cybernetics, 2004 IEEE International Conference on

ISSN

1062-922X

Print_ISBN

0-7803-8566-7

Type

conf

DOI

10.1109/ICSMC.2004.1400937

Filename

1400937

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=428593