DocumentCode :
619826
Title :
Technique analysis and designing of program with UCT algorithm for NoGo
Author :
Rui Li ; Yueqiu Wu ; Zhang, Angela ; Chen Ma ; Bo Chen ; Shuliang Wang
Author_Institution :
Sch. of Software, Beijing Inst. of Technol., Beijing, China
fYear :
2013
fDate :
25-27 May 2013
Firstpage :
923
Lastpage :
928
Abstract :
As a typical example of dynamic search algorithm, the UCT algorithm was initially used on the computerized game of GO. This paper briefly introduces the Markov Decision process, the Multi-armed Bandit model, and the Upper-Confidence Bandit formula. It analyzes the source and structure of the UCT algorithm in theory, and proves that the UCT algorithm is suitable for the design of the program of NoGo. According to the characteristics of NoGo, in the paper we improved the algorithm in terms of move generation and data reuse. We also tried to establish an off-line knowledge database for research. With experimental data we have tested and evaluated the above methods. The above algorithm and technology have been successfully used in WTShadows-the NoGo game program, which enabled us to have won the champion in national competition.
Keywords :
Markov processes; computer games; database management systems; knowledge based systems; search problems; Markov decision process; NoGo game; UCT algorithm; data reuse; dynamic search algorithm; move generation; multiarmed Bandit model; offline knowledge database; program design; upper-confidence Bandit formula; Algorithm design and analysis; Games; Heuristic algorithms; Knowledge based systems; Markov processes; Mathematical model; Runtime; Dynamic Move Queue; Knowledge Base; MAB Model; Markov Decision Process; NoGo; UCT Algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control and Decision Conference (CCDC), 2013 25th Chinese
Conference_Location :
Guiyang
Print_ISBN :
978-1-4673-5533-9
Type :
conf
DOI :
10.1109/CCDC.2013.6561055
Filename :
6561055
Link To Document :
بازگشت