• DocumentCode
    186287
  • Title

    Extending cortical-basal inspired reinforcement learning model with success-failure experience

  • Author

    Debnath, Shoubhik ; Nassour, John

  • Author_Institution
    Inst. for Cognitive Syst., Tech. Univ. Munich, Munich, Germany
  • fYear
    2014
  • fDate
    13-16 Oct. 2014
  • Firstpage
    293
  • Lastpage
    298
  • Abstract
    Neurocognitive studies showed that neurons of the orbitofrontal cortex get activated for expectation of immediate reward. Therefore they are the key reward structure in the brain. It was also shown that neurons in the anterior cingulate cortex work as an early warning system that prevents repeating mistakes. This paper introduces an extended model of reinforcement learning in the cortex-basal ganglia network by the hypothetical involvement of two cortical regions, the orbitofrontal cortex and the anterior cingulate cortex. In order to prove the effectiveness of the approach, we propose an enhanced actor-critic method that is guided by experiences of success and failure. Failures help the agent to explore regions by avoiding past mistakes. Successful experiences allow to exploit those regions that guarantee the agent to reach its goal. First, the method was applied to a 2-D grid problem, where an agent had to reach its goal by avoiding obstacles in its path. Second, the proposed RL model was used to optimize the learning policy of how to play bowling by the NAO humanoid robot. The results showed significant improvement using the enhanced actor-critic method both in terms of performance and rate of learning compared with the standard actor-critic method.
  • Keywords
    learning (artificial intelligence); 2D grid problem; NAO humanoid robot; RL model; actor-critic method; agent; anterior cingulate cortex; brain; cortex-basal ganglia network; cortical regions; cortical-basal inspired reinforcement learning model; early warning system; learning policy; neurocognitive studies; orbitofrontal cortex; success-failure experience; Brain modeling; Computational modeling; Encoding; Joints; Learning (artificial intelligence); Pins; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2014 Joint IEEE International Conferences on
  • Conference_Location
    Genoa
  • Type

    conf

  • DOI
    10.1109/DEVLRN.2014.6982996
  • Filename
    6982996