• DocumentCode
    1903449
  • Title

    Mining Causal Outliers Using Gaussian Bayesian Networks

  • Author

    Babbar, S. ; Chawla, Sanjay

  • Author_Institution
    Sch. of Inf. Technol., Univ. of Sydney, Sydney, NSW, Australia
  • Volume
    1
  • fYear
    2012
  • fDate
    7-9 Nov. 2012
  • Firstpage
    97
  • Lastpage
    104
  • Abstract
    Outliers are often identified as data points which are "rare\´\´, "isolated\´\´, or far away from their nearest neighbours. In this paper we demonstrate that meaningful outliers, i.e., outliers which perhaps encode important or new information are those which violate causal relationships. We first build a Bayesian network which encode causal relationships between attributes and then identify those points as outliers which violate these causal relationships. Experiments on several data sets confirm that the outliers identified in this fashion are in some sense "genuine\´\´ as they reveal new information about the underlying data generating process.
  • Keywords
    Gaussian processes; belief networks; causality; data mining; Gaussian Bayesian networks; causal outliers mining; causal relationships; data generating process; data points; nearest neighbours; Bayes methods; Data mining; Educational institutions; Equations; Mathematical model; Standards; Training data; Bayesian networks; Causality and Outliers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2012 IEEE 24th International Conference on
  • Conference_Location
    Athens
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4799-0227-9
  • Type

    conf

  • DOI
    10.1109/ICTAI.2012.22
  • Filename
    6495034