DocumentCode :
2548986
Title :
An improved immune Q-learning algorithm
Author :
Ji, Zhengqiao ; Wu, Q. M Jonathan ; Sid-Ahmed, Maher
Author_Institution :
Univ. of Windsor, Windsor
fYear :
2007
fDate :
7-10 Oct. 2007
Firstpage :
1636
Lastpage :
1641
Abstract :
Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance between exploration and exploitation is one of the key problems of action selection in reinforcement learning. Exploitation causes the agent to reach a locally optimal policy quickly, whereas excessive exploration degrades the performance of the algorithm, though it may improve the learning performance and escape from a locally optimal policy. Recently the human immune systems have aroused researcher´s interest due to its useful mechanisms which can be exploited for information processing in a complex cognition system. In this paper, we transplant some immune mechanisms into the basic Q-learning algorithm and convert Q-learning algorithm into a search for the optimum solution in combinatorial optimization. Experiments show that the improved Q-learning converges more quickly than Q-learning or Boltzmann exploration, and easily obtains the global solution set.
Keywords :
combinatorial mathematics; learning (artificial intelligence); optimisation; Boltzmann exploration; combinatorial optimization; complex cognition system; human immune systems; immune Q-learning algorithm; locally optimal policy; reinforcement learning; Autonomous agents; Cognition; Degradation; Delay; Humans; Immune system; Information processing; Learning automata; Learning systems; Navigation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
978-1-4244-0990-7
Electronic_ISBN :
978-1-4244-0991-4
Type :
conf
DOI :
10.1109/ICSMC.2007.4414135
Filename :
4414135
Link To Document :
بازگشت