DocumentCode
67590
Title
Novel Cost-Sensitive Approach to Improve the Multilayer Perceptron Performance on Imbalanced Data
Author
Castro, C.L. ; Braga, Antonio Padua
Author_Institution
Grad. Program in Electr. Eng., Fed. Univ. of Minas Gerais, Belo Horizonte, Brazil
Volume
24
Issue
6
fYear
2013
fDate
Jun-13
Firstpage
888
Lastpage
899
Abstract
Traditional learning algorithms applied to complex and highly imbalanced training sets may not give satisfactory results when distinguishing between examples of the classes. The tendency is to yield classification models that are biased towards the overrepresented (majority) class. This paper investigates this class imbalance problem in the context of multilayer perceptron (MLP) neural networks. The consequences of the equal cost (loss) assumption on imbalanced data are formally discussed from a statistical learning theory point of view. A new cost-sensitive algorithm (CSMLP) is presented to improve the discrimination ability of (two-class) MLPs. The CSMLP formulation is based on a joint objective function that uses a single cost parameter to distinguish the importance of class errors. The learning rule extends the Levenberg-Marquadt´s rule, ensuring the computational efficiency of the algorithm. In addition, it is theoretically demonstrated that the incorporation of prior information via the cost parameter may lead to balanced decision boundaries in the feature space. Based on the statistical analysis of results on real data, our approach shows a significant improvement of the area under the receiver operating characteristic curve and G-mean measures of regular MLPs.
Keywords
learning (artificial intelligence); multilayer perceptrons; pattern classification; sensitivity analysis; statistical analysis; CSMLP formulation; G-mean measures; Levenberg-Marquadt rule; MLP neural networks; balanced decision boundaries; class imbalance problem; classification models; cost-sensitive algorithm; discrimination ability improvement; imbalanced data; joint objective function; learning rule; multilayer perceptron performance improvement; receiver operating characteristic curve; statistical analysis; statistical learning theory; Algorithm design and analysis; Cost function; Jacobian matrices; Joints; Multilayer perceptrons; Training; Vectors; Class imbalance problem; Levenberg–Marquadt; cost-sensitive approach; multilayer perceptron; objective function;
fLanguage
English
Journal_Title
Neural Networks and Learning Systems, IEEE Transactions on
Publisher
ieee
ISSN
2162-237X
Type
jour
DOI
10.1109/TNNLS.2013.2246188
Filename
6469237
Link To Document