• Title of article

    Stochastic Dynamic Production Control by Neurodynamic Programming

  • Author/Authors

    Monostori، نويسنده , , L. and Csلji، نويسنده , , B.Cs.، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2006
  • Pages
    6
  • From page
    473
  • To page
    478
  • Abstract
    The paper proposes Markov Decision Processes (MDPs) to model production control systems that work in uncertain and changing environments. In an MDP finding an optimal control policy can be traced back to computing the optimal value function, which is the unique solution of the Bellman equation. Reinforcement learning methods, such as Q-learning, can be used for estimating this function; however, the value estimations are often only available for a few states of the environment, typically generated by simulation. The paper suggests the application of a new type of support vector regression model, called ?-SVR, which can effectively fit a smooth function to the available data and allow good generalization properties. The effectiveness of the approach is shown by experimental results on both benchmark and industry related data.
  • Keywords
    Machine Learning , production control , Neurodynamic programming
  • Journal title
    CIRP Annals - Manufacturing Technology
  • Serial Year
    2006
  • Journal title
    CIRP Annals - Manufacturing Technology
  • Record number

    2267548