• DocumentCode
    67590
  • Title

    Novel Cost-Sensitive Approach to Improve the Multilayer Perceptron Performance on Imbalanced Data

  • Author

    Castro, C.L. ; Braga, Antonio Padua

  • Author_Institution
    Grad. Program in Electr. Eng., Fed. Univ. of Minas Gerais, Belo Horizonte, Brazil
  • Volume
    24
  • Issue
    6
  • fYear
    2013
  • fDate
    Jun-13
  • Firstpage
    888
  • Lastpage
    899
  • Abstract
    Traditional learning algorithms applied to complex and highly imbalanced training sets may not give satisfactory results when distinguishing between examples of the classes. The tendency is to yield classification models that are biased towards the overrepresented (majority) class. This paper investigates this class imbalance problem in the context of multilayer perceptron (MLP) neural networks. The consequences of the equal cost (loss) assumption on imbalanced data are formally discussed from a statistical learning theory point of view. A new cost-sensitive algorithm (CSMLP) is presented to improve the discrimination ability of (two-class) MLPs. The CSMLP formulation is based on a joint objective function that uses a single cost parameter to distinguish the importance of class errors. The learning rule extends the Levenberg-Marquadt´s rule, ensuring the computational efficiency of the algorithm. In addition, it is theoretically demonstrated that the incorporation of prior information via the cost parameter may lead to balanced decision boundaries in the feature space. Based on the statistical analysis of results on real data, our approach shows a significant improvement of the area under the receiver operating characteristic curve and G-mean measures of regular MLPs.
  • Keywords
    learning (artificial intelligence); multilayer perceptrons; pattern classification; sensitivity analysis; statistical analysis; CSMLP formulation; G-mean measures; Levenberg-Marquadt rule; MLP neural networks; balanced decision boundaries; class imbalance problem; classification models; cost-sensitive algorithm; discrimination ability improvement; imbalanced data; joint objective function; learning rule; multilayer perceptron performance improvement; receiver operating characteristic curve; statistical analysis; statistical learning theory; Algorithm design and analysis; Cost function; Jacobian matrices; Joints; Multilayer perceptrons; Training; Vectors; Class imbalance problem; Levenberg–Marquadt; cost-sensitive approach; multilayer perceptron; objective function;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2013.2246188
  • Filename
    6469237