• DocumentCode
    264847
  • Title

    Simulated Annealing Monte Carlo Tree Search for Large POMDPs

  • Author

    Kai Xiong ; Hong Jiang

  • Author_Institution
    Southwest Univ. of Sci. & Technol., Mianyang, China
  • Volume
    1
  • fYear
    2014
  • fDate
    26-27 Aug. 2014
  • Firstpage
    140
  • Lastpage
    143
  • Abstract
    Many planning and control problems can be modeled as large POMDPs, but very few can be solved scalably because of their computational complexity. This paper proposes a Simulated Annealing based on the Monte Carlo Tree Search for large POMDPs. The proposed algorithm determines an acceptance probability of sampling a back-propagation´s outcome in the simulated annealing process. The experiments show that the proposed SAMCTS (Simulated Annealing Monte Carlo Tree Search) outperforms the original Simulated Annealing algorithm when applied to a large POMDP benchmark problem.
  • Keywords
    Markov processes; Monte Carlo methods; backpropagation; computational complexity; multi-agent systems; probability; simulated annealing; tree searching; POMDP benchmark problem; acceptance probability; agents; back-propagation outcome; computational complexity; partially observable Markov decision process; simulated annealing Monte Carlo tree search; Games; Markov processes; Monte Carlo methods; Planning; Rocks; Search problems; Simulated annealing; MCTS; POMDPs; Simulated Annealing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2014 Sixth International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4799-4956-4
  • Type

    conf

  • DOI
    10.1109/IHMSC.2014.42
  • Filename
    6917325