DocumentCode
2477872
Title
Temporal difference learning to detect unsafe system states
Author
Ning, Huazhong ; Xu, Wei ; Zhou, Yue ; Gong, Yihong ; Huang, Thomas
Author_Institution
ECE Dept., U. of Illinois at Urbana-Champaign, Urbana, IL
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
This paper proposes a general framework to detect unsafe states of a system whose basic realtime parameters are captured by multi-sensors. Our approach is to learn a danger level function which can be used to alert the users in advance of dangerous situations. The main challenge to this learning problem is the labelling issue, i.e., it is difficult to assign an objective danger level at each time step to the training data, except at the collapse points where a penalty can be assigned and at the successful ends where a certain reward can be assigned. In this paper, we treat the danger level as expected future reward (penalty is regarded as negative reward) and use temporal difference (TD) learning [2] to learn a function to approximate the expected future reward. The TD learning obtains the approximation by propagating the penalty/reward observable at collapse points or successful ends to the entire feature space following some constraints. Our approach is applied to, but not limited to, the application of monitoring of driving safety and the experimental results demonstrate the effectiveness of the approach.
Keywords
driver information systems; sensor fusion; temporal reasoning; danger level function; driving safety; expected future reward; labelling issue; multisensors; temporal difference learning; unsafe states detect; Detectors; Labeling; Laboratories; Learning; Monitoring; National electric code; Safety; Sensor systems; Space technology; Training data; Driving Safety; Multi-sensor; Temporal Difference Learning; Unsafe System State;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761237
Filename
4761237
Link To Document