• DocumentCode
    3228119
  • Title

    Sampled Bayesian Network Classifiers for Class-Imbalance and Cost-Sensitive Learning

  • Author

    Liangxiao Jiang ; Chaoqun Li ; Zhihua Cai ; Zhang, Haijun

  • Author_Institution
    Dept. of Comput. Sci., China Univ. of Geosci., Wuhan, China
  • fYear
    2013
  • fDate
    4-6 Nov. 2013
  • Firstpage
    512
  • Lastpage
    517
  • Abstract
    In many real-world applications, it is often the case that the class distribution of instances is imbalanced and the costs of misclassification are different. Thus, class-imbalance and cost-sensitive learning have attracted much attention from researchers. Sampling is one of the widely used approaches in dealing with the class imbalance problem, which alters the class distribution of instances so that the minority class is well represented in the training data. In this paper, we study the effect of sampling the natural training data on state-of-the-art Bayesian network classifiers, such as Naive Bayes (NB), Tree Augmented Naïve Bayes (TAN), Averaged One-Dependence Estimators (AODE), Weighted Average of One-Dependence Estimators (WAODE), and Hidden naive Bayes (HNB) and propose sampled Bayesian network classifiers. Our experimental results on a large number of UCI datasets show that our sampled Bayesian network classifiers perform much better than the ones trained from the natural training data especially when the natural training data is highly imbalanced and the cost ratio is high enough.
  • Keywords
    Bayes methods; pattern classification; trees (mathematics); HNB classifier; TAN classifier; UCI datasets; WAODE classifier; averaged one-dependence estimator classifier; class distribution; class imbalance problem; cost-sensitive learning; hidden naive Bayes classifier; misclassification cost; natural training data sampling; sampled Bayesian network classifiers; tree augmented naïve Bayes classifier; weighted average of one-dependence estimator classifier; Bayes methods; Breast cancer; Educational institutions; Ionosphere; Niobium; Training; Training data; Bayesian network classifiers; class-imbalance learning; cost-sensitive learning; misclassification cost; true positive rate;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2013 IEEE 25th International Conference on
  • Conference_Location
    Herndon, VA
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4799-2971-9
  • Type

    conf

  • DOI
    10.1109/ICTAI.2013.82
  • Filename
    6735293