DocumentCode
3228119
Title
Sampled Bayesian Network Classifiers for Class-Imbalance and Cost-Sensitive Learning
Author
Liangxiao Jiang ; Chaoqun Li ; Zhihua Cai ; Zhang, Haijun
Author_Institution
Dept. of Comput. Sci., China Univ. of Geosci., Wuhan, China
fYear
2013
fDate
4-6 Nov. 2013
Firstpage
512
Lastpage
517
Abstract
In many real-world applications, it is often the case that the class distribution of instances is imbalanced and the costs of misclassification are different. Thus, class-imbalance and cost-sensitive learning have attracted much attention from researchers. Sampling is one of the widely used approaches in dealing with the class imbalance problem, which alters the class distribution of instances so that the minority class is well represented in the training data. In this paper, we study the effect of sampling the natural training data on state-of-the-art Bayesian network classifiers, such as Naive Bayes (NB), Tree Augmented Naïve Bayes (TAN), Averaged One-Dependence Estimators (AODE), Weighted Average of One-Dependence Estimators (WAODE), and Hidden naive Bayes (HNB) and propose sampled Bayesian network classifiers. Our experimental results on a large number of UCI datasets show that our sampled Bayesian network classifiers perform much better than the ones trained from the natural training data especially when the natural training data is highly imbalanced and the cost ratio is high enough.
Keywords
Bayes methods; pattern classification; trees (mathematics); HNB classifier; TAN classifier; UCI datasets; WAODE classifier; averaged one-dependence estimator classifier; class distribution; class imbalance problem; cost-sensitive learning; hidden naive Bayes classifier; misclassification cost; natural training data sampling; sampled Bayesian network classifiers; tree augmented naïve Bayes classifier; weighted average of one-dependence estimator classifier; Bayes methods; Breast cancer; Educational institutions; Ionosphere; Niobium; Training; Training data; Bayesian network classifiers; class-imbalance learning; cost-sensitive learning; misclassification cost; true positive rate;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence (ICTAI), 2013 IEEE 25th International Conference on
Conference_Location
Herndon, VA
ISSN
1082-3409
Print_ISBN
978-1-4799-2971-9
Type
conf
DOI
10.1109/ICTAI.2013.82
Filename
6735293
Link To Document