Title of article :
Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection
Author/Authors :
Avila، نويسنده , , Luis and Martيnez، نويسنده , , Ernesto، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2014
Pages :
19
From page :
6327
To page :
6345
Abstract :
The increasing trend towards delegating tasks to autonomous artificial agents in safety–critical socio-technical systems makes monitoring an action selection policy of paramount importance. Agent behavior monitoring may profit from a stochastic specification of an optimal policy under uncertainty. A probabilistic monitoring approach is proposed to assess if an agent behavior (or policy) respects its specification. The desired policy is modeled by a prior distribution for state transitions in an optimally-controlled stochastic process. Bayesian surprise is defined as the Kullback–Leibler divergence between the state transition distribution for the observed behavior and the distribution for optimal action selection. To provide a sensitive on-line estimation of Bayesian surprise with small samples twin Gaussian processes are used. Timely detection of a deviant behavior or anomaly in an artificial pancreas highlights the sensitivity of Bayesian surprise to a meaningful discrepancy regarding the stochastic optimal policy when there exist excessive glycemic variability, sensor errors, controller ill-tuning and infusion pump malfunctioning. To reject outliers and leave out redundant information, on-line sparsification of data streams is proposed.
Keywords :
Bayesian surprise , Behavior monitoring , Kullback–Leibler divergence , artificial pancreas , Optimal action selection
Journal title :
Expert Systems with Applications
Serial Year :
2014
Journal title :
Expert Systems with Applications
Record number :
2355083
Link To Document :
بازگشت