• DocumentCode
    3525384
  • Title

    Markov decision process toolbox

  • Author

    Shan Peng ; Li Ran ; Ning Sheng-hua ; Yang Qin

  • Author_Institution
    Dept. Electron. Inf. & Control Eng., Guangxi Univ. of Technol., Liuzhou, China
  • fYear
    2009
  • fDate
    18-20 Sept. 2009
  • Firstpage
    123
  • Lastpage
    128
  • Abstract
    Markov decision process is optimal policy-making process which is based on the Markov process theory of random dynamical systems. It is also a theoretical tool to study optimization problems about multi-stage policy-making process in random environment. For its wide range of applications, developing the Markov decision process toolbox is of great significance for the scientific computing software SCILAB. Markov policy process consists of three main criterions: the expected total reward criterion, discount criterion and average criterion. Finally, taking the toys manufacturers as the example the effectiveness of the method is tested.
  • Keywords
    Markov processes; decision theory; mathematics computing; natural sciences computing; Markov decision process toolbox; SCILAB; average criterion; discount criterion; expected total reward criterion; multistage policy-making process; optimal policy-making process; random dynamical system; scientific computing software; toys manufacturers; Application software; Control systems; Dynamic programming; Macroeconomics; Markov processes; Mathematical model; Scientific computing; Software tools; Stochastic systems; Toy manufacturing industry; Markov decision process; SCILAB; average criterion; discount criterion; expectation total reward criterion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Open-source Software for Scientific Computation (OSSC), 2009 IEEE International Workshop on
  • Conference_Location
    Guiyang
  • Print_ISBN
    978-1-4244-4452-6
  • Electronic_ISBN
    978-1-4244-4453-3
  • Type

    conf

  • DOI
    10.1109/OSSC.2009.5416859
  • Filename
    5416859