DocumentCode :
1475305
Title :
Improved Binocular Vergence Control via a Neural Network That Maximizes an Internally Defined Reward
Author :
Wang, Yiwen ; Shi, Bertram E.
Author_Institution :
Qiushi Acad. for Adv. Studies, Zhejiang Univ., Hangzhou, China
Volume :
3
Issue :
3
fYear :
2011
Firstpage :
247
Lastpage :
256
Abstract :
We describe the autonomous development of binocular vergence control in an active robotic vision system through attention-gated reinforcement learning (AGREL). The control policy is implemented by a neural network, which maps the outputs from a population of disparity energy neurons to a set of vergence commands. The network learns to maximize a reward signal that is based on an internal representation of the visual input: the total activation in the population of disparity energy neurons. This system extends previous work using Q learning by increasing the complexity of the policy in two ways. First, the input state space is continuous, rather than discrete, and is based upon a larger diversity of neurons. Second, we increase the number of possible actions. We evaluate the network learning and performance on natural images and with real objects in a cluttered environment. The policies learned by the network outperform policies by Q learning in two ways: the mean squared errors are smaller and the closed loop frequency response has larger bandwidth.
Keywords :
learning (artificial intelligence); neurocontrollers; robot vision; state-space methods; Q learning; active robotic vision system; attention-gated reinforcement learning; binocular vergence control; closed loop frequency response; disparity energy neurons; input state space; internally defined reward maximization; mean squared errors; network learning; neural network; Artificial neural networks; Cameras; Control systems; Learning; Neurons; Pixel; Visualization; Attention-gated reinforcement learning; autonomous vergence control; disparity energy neuron;
fLanguage :
English
Journal_Title :
Autonomous Mental Development, IEEE Transactions on
Publisher :
ieee
ISSN :
1943-0604
Type :
jour
DOI :
10.1109/TAMD.2011.2128318
Filename :
5734799
Link To Document :
بازگشت