DocumentCode
250731
Title
A connectionist actor-critic algorithm for faster learning and biological plausibility
Author
Johard, Leonard ; Ruffaldi, Emanuele
Author_Institution
Scuola Superiore S. Anna, PERCRO, Pisa, Italy
fYear
2014
fDate
May 31 2014-June 7 2014
Firstpage
3903
Lastpage
3909
Abstract
We propose a novel biologically plausible actor-critic algorithm using policy gradients in order to achieve practical, model-free reinforcement learning. It does not rely on backpropagation and is the first neural actor-critic relying only on locally available information. We show it has an advantage over pure policy gradients methods for motor learning performance in the polecart problem. We are also able to closely simulate the dopaminergic signaling patterns in rats when confronted with a two cue problem, showing that local, connectionist models can effectively model the functioning of the intrinsic reward system.
Keywords
biology computing; gradient methods; learning (artificial intelligence); neural nets; biologically plausible actor-critic algorithm; connectionist actor-critic algorithm; dopaminergic signaling patterns; intrinsic reward system; model-free reinforcement learning; neural actor-critic; polecart problem; policy gradients; Backpropagation; Biological system modeling; Learning (artificial intelligence); Neurons; Supervised learning; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Automation (ICRA), 2014 IEEE International Conference on
Conference_Location
Hong Kong
Type
conf
DOI
10.1109/ICRA.2014.6907425
Filename
6907425
Link To Document