Title :
Mining Causal Outliers Using Gaussian Bayesian Networks
Author :
Babbar, S. ; Chawla, Sanjay
Author_Institution :
Sch. of Inf. Technol., Univ. of Sydney, Sydney, NSW, Australia
Abstract :
Outliers are often identified as data points which are "rare\´\´, "isolated\´\´, or far away from their nearest neighbours. In this paper we demonstrate that meaningful outliers, i.e., outliers which perhaps encode important or new information are those which violate causal relationships. We first build a Bayesian network which encode causal relationships between attributes and then identify those points as outliers which violate these causal relationships. Experiments on several data sets confirm that the outliers identified in this fashion are in some sense "genuine\´\´ as they reveal new information about the underlying data generating process.
Keywords :
Gaussian processes; belief networks; causality; data mining; Gaussian Bayesian networks; causal outliers mining; causal relationships; data generating process; data points; nearest neighbours; Bayes methods; Data mining; Educational institutions; Equations; Mathematical model; Standards; Training data; Bayesian networks; Causality and Outliers;
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2012 IEEE 24th International Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4799-0227-9
DOI :
10.1109/ICTAI.2012.22