DocumentCode :
3637138
Title :
Improving classification with cost-sensitive approach and Support Vector Machine
Author :
Maria Muntean;Ioan Ileană;Corina Rotar;Honoriu Vălean
Author_Institution :
Computer Science Department, 1 Decembrie 1918 University of Alba Iulia, Romania
fYear :
2010
Firstpage :
180
Lastpage :
185
Abstract :
A problem arises in data mining, when classifying unbalanced datasets using Support Vector Machines. Because of the uneven distribution and the soft margin of the classifier, the algorithm tries to improve the general accuracy of classifying a dataset, and in this process it might misclassify a lot of weakly represented classes, confusing their class instances as overshoot values that appear in the dataset, and thus ignoring them. This paper introduces the Enhancer, a new algorithm that improves the Cost-sensitive classification for Support Vector Machines, by multiplying in the training step the instances of the underrepresented classes. We have discovered that by oversampling the instances of the class of interest, we are helping the Support Vector Machine algorithm to overcome the soft margin. As an effect, it classifies better future instances of this class of interest.
Keywords :
"Support vector machines","Support vector machine classification","Kernel","Statistical learning","Automation","Computer science","Data mining","Testing","Lagrangian functions","Machine learning algorithms"
Publisher :
ieee
Conference_Titel :
Roedunet International Conference (RoEduNet), 2010 9th
ISSN :
2068-1038
Print_ISBN :
978-1-4244-7335-9
Type :
conf
Filename :
5541578
Link To Document :
بازگشت