DocumentCode :
151496
Title :
ModBoost for unbiased classification
Author :
Shah, Karan ; Gala, Sapna ; Patil, Nahush
Author_Institution :
Dept. of Inf. Technol., Univ. of Mumbai, Mumbai, India
fYear :
2014
fDate :
5-6 Sept. 2014
Firstpage :
1
Lastpage :
5
Abstract :
Many real world data mining applications involve learning from large data sets. Boosting, an ensemble-based learning algorithm, has shown to improve the performance of classifiers in many situations. In this paper, we describe a new approach that is a modified version of boosting. In the ModBoost method, the sample training set is produced from the given dataset by sampling with replacement. The model is generated from the sample training set and is evaluated for error on the dataset instead of the training set. Misclassified tuples are given a higher weight and subsequently, in the next iterations the misclassified tuples are given first preferences and resampling is done for the rest of the training set. The total weights of the different classes in the new training set are rebalanced. The ModBoost method is evaluated, in terms of the F-measures, Precision, Recall and overall accuracy against large and moderately imbalanced bank marketing data set using Naïve Bayesian as a base classifier. The results are promising and show that the ModBoost method compares well in comparison with AdaBoost and bagging algorithm. Our results prove empirically that ModBoost is an attractive alternative for improving the classification performance. We also provide suggestions for future research.
Keywords :
Bayes methods; banking; data mining; learning (artificial intelligence); marketing; pattern classification; AdaBoost; F-measures; ModBoost method; bagging algorithm; base classifier; classification performance; ensemble-based learning algorithm; imbalanced bank marketing data set; misclassified tuples; naïve Bayesian; real world data mining applications; unbiased classification; Accuracy; Bagging; Boosting; Classification algorithms; Data models; Prediction algorithms; Training; Bagging; Boosting; Classification; Data mining; Ensembles of classifiers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining and Intelligent Computing (ICDMIC), 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-4675-4
Type :
conf
DOI :
10.1109/ICDMIC.2014.6954252
Filename :
6954252
Link To Document :
بازگشت