• DocumentCode
    589305
  • Title

    The Class-Imbalance Problem for High-Dimensional Class Prediction

  • Author

    Lusa, L. ; Blagus, R.

  • Volume
    2
  • fYear
    2012
  • fDate
    12-15 Dec. 2012
  • Firstpage
    123
  • Lastpage
    126
  • Abstract
    The goal of class prediction studies is to develop rules to accurately predict the class membership of new subjects. The classifiers differ in the way they combine the values of the variables available for each subject. Frequently the classifiers are developed using class-imbalanced data, where the number of samples in each class is not equal. Standard classification methods used on class-imbalanced data are often biased towards the majority class: they classify most new samples in the majority class and they do not accurately predict the minority class. Data are high-dimensional when the number of variables greatly exceeds the number of subjects. In this paper we show how the high-dimensionality poses additional challenges when dealing with class-imbalanced prediction. Here we present new simulation studies for five classifiers, where we expand our previous results to correlated variables, and briefly discuss the results.
  • Keywords
    pattern classification; class membership prediction; class-imbalanced data; class-imbalanced prediction; class-imbalanced problem; classifiers; high-dimensional class prediction; high-dimensional data; Accuracy; Data models; Input variables; Radio frequency; Simulation; Support vector machines; Training; class-imbalance; classification; high-dimensional data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2012 11th International Conference on
  • Conference_Location
    Boca Raton, FL
  • Print_ISBN
    978-1-4673-4651-1
  • Type

    conf

  • DOI
    10.1109/ICMLA.2012.223
  • Filename
    6406739