Title :
Alleviating class imbalance problem in data mining
Author :
Sarmanova, Akkenzhe ; Albayrak, Sahin
Author_Institution :
Bigisayar Muhendisligi, Yildiz Teknik Univ., İstanbul, Turkey
Abstract :
The class imbalance problem in two-class data sets is one of the most important problems. When samples of one class in a training data set vastly outnumber samples of the other class, standard machine learning algorithms tend to be overwhelmed by the majority class and ignore the minority class. There are several algorithms to alleviate the problem of class imbalance in literature. In this paper experiments have been done comparing the existing algorithms with each other and the algorithm which has the best performance tried to be found.
Keywords :
data mining; learning (artificial intelligence); class imbalance problem; data mining; minority class; standard machine learning algorithms; training data set; two-class data sets; Boosting; Breast; Classification algorithms; Ionosphere; Machine learning algorithms; Nickel; Support vector machines; binary classification; boosting; class imbalance; resampling;
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2013 21st
Conference_Location :
Haspolat
Print_ISBN :
978-1-4673-5562-9
Electronic_ISBN :
978-1-4673-5561-2
DOI :
10.1109/SIU.2013.6531574