DocumentCode
1903449
Title
Mining Causal Outliers Using Gaussian Bayesian Networks
Author
Babbar, S. ; Chawla, Sanjay
Author_Institution
Sch. of Inf. Technol., Univ. of Sydney, Sydney, NSW, Australia
Volume
1
fYear
2012
fDate
7-9 Nov. 2012
Firstpage
97
Lastpage
104
Abstract
Outliers are often identified as data points which are "rare\´\´, "isolated\´\´, or far away from their nearest neighbours. In this paper we demonstrate that meaningful outliers, i.e., outliers which perhaps encode important or new information are those which violate causal relationships. We first build a Bayesian network which encode causal relationships between attributes and then identify those points as outliers which violate these causal relationships. Experiments on several data sets confirm that the outliers identified in this fashion are in some sense "genuine\´\´ as they reveal new information about the underlying data generating process.
Keywords
Gaussian processes; belief networks; causality; data mining; Gaussian Bayesian networks; causal outliers mining; causal relationships; data generating process; data points; nearest neighbours; Bayes methods; Data mining; Educational institutions; Equations; Mathematical model; Standards; Training data; Bayesian networks; Causality and Outliers;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence (ICTAI), 2012 IEEE 24th International Conference on
Conference_Location
Athens
ISSN
1082-3409
Print_ISBN
978-1-4799-0227-9
Type
conf
DOI
10.1109/ICTAI.2012.22
Filename
6495034
Link To Document