DocumentCode :
3184448
Title :
Learning in n-pursuer n-evader differential games
Author :
Desouky, Sameh F. ; Schwartz, Howard M.
Author_Institution :
Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, ON, Canada
fYear :
2010
fDate :
10-13 Oct. 2010
Firstpage :
4069
Lastpage :
4074
Abstract :
This paper discusses learning in n-purser n-evader games. In a pursuit-evasion game, one player (the pursuer) pursues another one while the other (the evader) tries to escape. We assume that each player only knows the instantaneous position of the other players but at the same time none of them knows its control strategy nor the control strategy of the other players. Therefore, the players have to self-learn their control strategies on-line by interaction with each other. In this paper, we extend our previous work from learning in a single pursuit-evasion game to learning in a multi-pursuit-evasion game. We use the Q(λ)-learning based genetic fuzzy controller technique (QLBGFC) proposed in. The proposed technique combines reinforcement learning with both a fuzzy controller and genetic algorithms in a two-phase structure. In addition to the proposed QLBGFC, we construct a new Q-table that is responsible for learning the coupling process between the pursuers and the evaders. To test the performance of the proposed technique, it is compared with the optimal strategy of a single pursuit-evasion game. Computer simulations show the usefulness of the proposed technique.
Keywords :
differential games; fuzzy control; genetic algorithms; learning (artificial intelligence); Q(λ)-learning based genetic fuzzy controller technique; Q-table; n-pursuer n-evader differential games; reinforcement learning; Couplings; Differential game; Q(λ)-learning; fuzzy control; genetic algorithms; multi-robot; pursuit-evasion game; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1062-922X
Print_ISBN :
978-1-4244-6586-6
Type :
conf
DOI :
10.1109/ICSMC.2010.5642186
Filename :
5642186
Link To Document :
بازگشت