• DocumentCode
    3600888
  • Title

    Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems

  • Author

    Ng, Wing W. Y. ; Hu, Junjie ; Yeung, Daniel S. ; Shaohua Yin ; Roli, Fabio

  • Author_Institution
    Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
  • Volume
    45
  • Issue
    11
  • fYear
    2015
  • Firstpage
    2402
  • Lastpage
    2412
  • Abstract
    Undersampling is a widely adopted method to deal with imbalance pattern classification problems. Current methods mainly depend on either random resampling on the majority class or resampling at the decision boundary. Random-based undersampling fails to take into consideration informative samples in the data while resampling at the decision boundary is sensitive to class overlapping. Both techniques ignore the distribution information of the training dataset. In this paper, we propose a diversified sensitivity-based undersampling method. Samples of the majority class are clustered to capture the distribution information and enhance the diversity of the resampling. A stochastic sensitivity measure is applied to select samples from both clusters of the majority class and the minority class. By iteratively clustering and sampling, a balanced set of samples yielding high classifier sensitivity is selected. The proposed method yields a good generalization capability for 14 UCI datasets.
  • Keywords
    decision making; iterative methods; pattern classification; sampling methods; sensitivity analysis; stochastic processes; UCI datasets; classifier sensitivity; decision boundary; diversified sensitivity-based undersampling method; imbalance classification problems; random resampling; random-based under-sampling; stochastic sensitivity measure; Artificial neural networks; Clustering algorithms; Neurons; Sensitivity; Support vector machines; Time complexity; Training; Diversified sensitivity undersampling (DSUS); imbalance data; sample selection;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2014.2372060
  • Filename
    6971101