DocumentCode
428593
Title
An ant system based exploration-exploitation for reinforcement learning
Author
Chang, Hyeong Soo
Author_Institution
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea
Volume
4
fYear
2004
fDate
10-13 Oct. 2004
Firstpage
3805
Abstract
In this paper, we develop a novel exploration-exploitation strategy for reinforcement learning based on ant colony system. Most of the exploration-exploitation strategies use some statistics extracted from a single simulated trajectory. The novel strategy uses some statistics extracted from multiple simulated trajectories obtained from a swarm of ants. We show that the strategy preserves the convergence property of Q-learning.
Keywords
combinatorial mathematics; learning (artificial intelligence); optimisation; Q-learning; ant colony system; convergence property; exploration-exploitation strategy; reinforcement learning; single simulated trajectory; Ant colony optimization; Biological information theory; Biological system modeling; Birds; Casting; Computational modeling; Computer science; Learning; Marine animals; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2004 IEEE International Conference on
ISSN
1062-922X
Print_ISBN
0-7803-8566-7
Type
conf
DOI
10.1109/ICSMC.2004.1400937
Filename
1400937
Link To Document