DocumentCode
1413220
Title
FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning
Author
Batuwita, Rukshan ; Palade, Vasile
Author_Institution
Oxford Univ. Comput. Lab., Oxford Univ., Oxford, UK
Volume
18
Issue
3
fYear
2010
fDate
6/1/2010 12:00:00 AM
Firstpage
558
Lastpage
571
Abstract
Support vector machines (SVMs) is a popular machine learning technique, which works effectively with balanced datasets. However, when it comes to imbalanced datasets, SVMs produce suboptimal classification models. On the other hand, the SVM algorithm is sensitive to outliers and noise present in the datasets. Therefore, although the existing class imbalance learning (CIL) methods can make SVMs less sensitive to class imbalance, they can still suffer from the problem of outliers and noise. Fuzzy SVMs (FSVMs) is a variant of the SVM algorithm, which has been proposed to handle the problem of outliers and noise. In FSVMs, training examples are assigned different fuzzy-membership values based on their importance, and these membership values are incorporated into the SVM learning algorithm to make it less sensitive to outliers and noise. However, like the normal SVM algorithm, FSVMs can also suffer from the problem of class imbalance. In this paper, we present a method to improve FSVMs for CIL (called FSVM-CIL), which can be used to handle the class imbalance problem in the presence of outliers and noise. We thoroughly evaluated the proposed FSVM-CIL method on ten real-world imbalanced datasets and compared its performance with five existing CIL methods, which are available for normal SVM training. Based on the overall results, we can conclude that the proposed FSVM-CIL method is a very effective method for CIL, especially in the presence of outliers and noise in datasets.
Keywords
fuzzy set theory; learning (artificial intelligence); pattern classification; support vector machines; class imbalance learning; fuzzy support vector machines; machine learning technique; suboptimal classification models; Class imbalance learning (CIL); fuzzy support vector machines (FSVMs); outliers; support vector machines (SVMs);
fLanguage
English
Journal_Title
Fuzzy Systems, IEEE Transactions on
Publisher
ieee
ISSN
1063-6706
Type
jour
DOI
10.1109/TFUZZ.2010.2042721
Filename
5409611
Link To Document