• DocumentCode
    2571887
  • Title

    The improvement of Q-learning applied to imperfect information game

  • Author

    Lin, Jing ; Wang, Xuan ; Han, Lijiao ; Zhang, Jiajia ; Xu, Xinxin

  • Author_Institution
    Intell. Comput. Res. Center, HIT Shenzhen, Shenzhen, China
  • fYear
    2009
  • fDate
    11-14 Oct. 2009
  • Firstpage
    1562
  • Lastpage
    1567
  • Abstract
    There exist problems of slow convergence and local optimum in standard Q-learning algorithm. Truncated TD estimate returns efficiency and simulated annealing algorithm increase the chance of exploration. To accelerate the algorithm convergence speed and to avoid results in local optimum, this paper combines Q-learning algorithm, truncated TD estimation and simulated annealing algorithm. We apply improved Q-learning algorithm using into the imperfect information game (SiGuo military chess game), and realize a self-learning of imperfect information game system. Experimental outcomes show that this system can dynamically adjust each weight which describes game state according to the results. Further, it speeds up the process of learning, effectively simulates human intelligence and makes reasonable step, and significantly improves system performance.
  • Keywords
    estimation theory; game theory; learning (artificial intelligence); simulated annealing; Q-learning algorithm; SiGuo military chess game; algorithm convergence speed; human intelligence; imperfect information game; self-learning; simulated annealing algorithm; truncated TD estimate returns efficiency; Acceleration; Computational modeling; Conference management; Convergence; Cybernetics; Humans; Military computing; Simulated annealing; Technology management; USA Councils; Q-learning; imperfect information game; simulated annealing; truncated TD;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2793-2
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2009.5346316
  • Filename
    5346316