• DocumentCode
    3113433
  • Title

    Policy Transition of Reinforcement Learning for an Agent Based SCM System

  • Author

    Zhao, Gang ; Sun, Ruoying

  • Author_Institution
    Beijing Inf. Sci. & Technol. Univ., Beijing
  • fYear
    2006
  • fDate
    16-18 Aug. 2006
  • Firstpage
    793
  • Lastpage
    798
  • Abstract
    Reinforcement learning (RL) is successfully applied to some dynamical and unpredictable domains. The Supply Chain Management (SCM) is NP-hard problem. Some proposed RL methods perform better than traditional tools for dynamic problem solving in SCM. It realizes on-line learning and performs efficiently in some applications, but RL agent reacts worse than some heuristic methods to sudden changes in SCM demand since the trial-and-error characteristic of RL is time-consuming in practice. By surveying an efficient policy transition mechanism in RL about how to mapping existing policies in the previous task to a new policies in a changed task, this paper proposes a novel RL agent based SCM system that decreases learning time of the RL agent to a dynamic environment. As the result, the RL agent derives the maximal profit using RL technique as jobs coming with a stable distribution. Further, the RL agent makes the optimal procurement satisfying the requirement of sudden changes in the supply chain network by the policy transition mechanism.
  • Keywords
    learning (artificial intelligence); optimisation; software agents; supply chain management; NP-hard problem; SCM; SCM system; dynamic problem solving; policy transition; reinforcement learning; supply chain management; trial-and-error characteristic; Chaos; Humans; Information science; Learning; NP-hard problem; Problem-solving; Scheduling; Sun; Supply chain management; Supply chains;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Informatics, 2006 IEEE International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    0-7803-9700-2
  • Electronic_ISBN
    0-7803-9701-0
  • Type

    conf

  • DOI
    10.1109/INDIN.2006.275663
  • Filename
    4053490