DocumentCode :
2477872
Title :
Temporal difference learning to detect unsafe system states
Author :
Ning, Huazhong ; Xu, Wei ; Zhou, Yue ; Gong, Yihong ; Huang, Thomas
Author_Institution :
ECE Dept., U. of Illinois at Urbana-Champaign, Urbana, IL
fYear :
2008
fDate :
8-11 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
This paper proposes a general framework to detect unsafe states of a system whose basic realtime parameters are captured by multi-sensors. Our approach is to learn a danger level function which can be used to alert the users in advance of dangerous situations. The main challenge to this learning problem is the labelling issue, i.e., it is difficult to assign an objective danger level at each time step to the training data, except at the collapse points where a penalty can be assigned and at the successful ends where a certain reward can be assigned. In this paper, we treat the danger level as expected future reward (penalty is regarded as negative reward) and use temporal difference (TD) learning [2] to learn a function to approximate the expected future reward. The TD learning obtains the approximation by propagating the penalty/reward observable at collapse points or successful ends to the entire feature space following some constraints. Our approach is applied to, but not limited to, the application of monitoring of driving safety and the experimental results demonstrate the effectiveness of the approach.
Keywords :
driver information systems; sensor fusion; temporal reasoning; danger level function; driving safety; expected future reward; labelling issue; multisensors; temporal difference learning; unsafe states detect; Detectors; Labeling; Laboratories; Learning; Monitoring; National electric code; Safety; Sensor systems; Space technology; Training data; Driving Safety; Multi-sensor; Temporal Difference Learning; Unsafe System State;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
ISSN :
1051-4651
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2008.4761237
Filename :
4761237
Link To Document :
بازگشت