DocumentCode :
2470826
Title :
Acquiring a broad range of empirical knowledge in real time by temporal-difference learning
Author :
Modayil, Joseph ; White, Adam ; Pilarski, Patrick M. ; Sutton, Richard S.
Author_Institution :
Dept. of Comput. Sci., Univ. of Alberta, Edmonton, AB, Canada
fYear :
2012
fDate :
14-17 Oct. 2012
Firstpage :
1903
Lastpage :
1910
Abstract :
Several robot capabilities rely on predictions about the temporally extended consequences of a robot´s behaviour. We describe how a robot can both learn and make many such predictions in real time using a standard algorithm. Our experiments show that a mobile robot can learn and make thousands of accurate predictions at 10 Hz. The predictions are about the future of all of the robot´s sensors and many internal state variables at multiple time-scales. All the predictions share a single set of features and learning parameters. We demonstrate the generality of this method with an application to a different platform, a robot arm operating at 50 Hz. Here, learned predictions can be used to measurably improve the user interface. The temporally extended predictions learned in real time by this method constitute a basic form of knowledge about the dynamics of the robot´s interaction with the environment. We also show how this method can be extended to express more general forms of knowledge.
Keywords :
control engineering computing; knowledge acquisition; learning (artificial intelligence); manipulators; sensors; user interfaces; empirical knowledge acquisition; frequency 10 Hz; frequency 50 Hz; learning parameter; robot arm; robot behaviour; robot capability; robot interaction; robot sensor; temporal-difference learning; user interface; Prediction algorithms; Real-time systems; Robot sensing systems; Strips; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
Type :
conf
DOI :
10.1109/ICSMC.2012.6378016
Filename :
6378016
Link To Document :
بازگشت