DocumentCode
233201
Title
Operant conditioning learning model based on BP network
Author
Huang Jing ; Ruan Xiaogang ; Li Lei ; Wei Ruoyan ; Fan Qingwu ; Wu Xuan
Author_Institution
Inst. of Artificial Intell. & Robot., Beijing Univ. of Technol., Beijing, China
fYear
2014
fDate
28-30 July 2014
Firstpage
8386
Lastpage
8390
Abstract
The naissance of cognitive robotics marks that psychology is more and more highly involved in the artificial intelligence research. Inspired by psychology and ethology, we propose an operant conditioning learning model based on BP (back-propagation) network named OCLMBP on the basis of Skinner´s relevant theory. The model is applied to the problem of obstacle avoidance with a wheeled robot. The robot controlled by the model can learn to avoid obstacles through a learning-by-doing style without any external supervision, but by the proximity sensors information as positive or negative reinforcement signals. The results are compared with original OCLM (operant conditioning learning model), and the proposed model has better performance.
Keywords
backpropagation; collision avoidance; learning (artificial intelligence); mobile robots; neurocontrollers; BP network; OCLMBP; Skinner relevant theory; artificial intelligence research; back-propagation network; cognitive robotics; ethology; learning-by-doing style; negative reinforcement signals; obstacle avoidance problem; operant conditioning learning model; positive reinforcement signals; proximity sensors; psychology; wheeled robot; Biological system modeling; Collision avoidance; Computational modeling; Mobile robots; Robot sensing systems; BP network; Operant conditioning; cognitive robotics; obstacle avoidance;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Conference (CCC), 2014 33rd Chinese
Conference_Location
Nanjing
Type
conf
DOI
10.1109/ChiCC.2014.6896407
Filename
6896407
Link To Document