DocumentCode :
1903449
Title :
Mining Causal Outliers Using Gaussian Bayesian Networks
Author :
Babbar, S. ; Chawla, Sanjay
Author_Institution :
Sch. of Inf. Technol., Univ. of Sydney, Sydney, NSW, Australia
Volume :
1
fYear :
2012
fDate :
7-9 Nov. 2012
Firstpage :
97
Lastpage :
104
Abstract :
Outliers are often identified as data points which are "rare\´\´, "isolated\´\´, or far away from their nearest neighbours. In this paper we demonstrate that meaningful outliers, i.e., outliers which perhaps encode important or new information are those which violate causal relationships. We first build a Bayesian network which encode causal relationships between attributes and then identify those points as outliers which violate these causal relationships. Experiments on several data sets confirm that the outliers identified in this fashion are in some sense "genuine\´\´ as they reveal new information about the underlying data generating process.
Keywords :
Gaussian processes; belief networks; causality; data mining; Gaussian Bayesian networks; causal outliers mining; causal relationships; data generating process; data points; nearest neighbours; Bayes methods; Data mining; Educational institutions; Equations; Mathematical model; Standards; Training data; Bayesian networks; Causality and Outliers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2012 IEEE 24th International Conference on
Conference_Location :
Athens
ISSN :
1082-3409
Print_ISBN :
978-1-4799-0227-9
Type :
conf
DOI :
10.1109/ICTAI.2012.22
Filename :
6495034
Link To Document :
بازگشت