• DocumentCode
    2063355
  • Title

    Comparing SVM ensembles for imbalanced datasets

  • Author

    Bhatnagar, Vasudha ; Bhardwaj, Manju ; Mahabal, Ashish

  • Author_Institution
    Dept. of Comput. Sci., Delhi Univ., Delhi, India
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 1 2010
  • Firstpage
    651
  • Lastpage
    657
  • Abstract
    Real life datasets often suffer from the problem of class imbalance, which thwarts supervised learning process. In such data sets examples of positive (minority) class are significantly less than those of negative (majority) class leading to severe class imbalance. Constructing high quality classifiers for such imbalanced training data sets is one of the major challenges in machine learning, since traditional classification algorithms tend to get biased towards majority class. In this paper, we compare three ensemble based approaches for handling imbalanced datasets. All the three approaches aim to overcome the under representation of minority class by learning from each of the minority class samples and a subset of majority class samples. The three approaches namely, PARTEN, UMjC and LFM were evaluated on several public datasets. Precision, recall, F- measure, g-mean and ROC space measures were used for comparison. Thread-bare discussion of the results is presented in the paper. Subsequently, we present an astronomy application, where the three methods are compared for prediction of class II, IIn and IIp supernovae.
  • Keywords
    data handling; learning (artificial intelligence); pattern classification; support vector machines; F-measure; LFM; PARTEN; ROC space measure; SVM ensemble; UMjC; high quality classifier; imbalanced dataset handling; imbalanced training data set; machine learning; supervised learning; Class imbalance; Classification; Ensembles; SVM; Supervised Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-1-4244-8134-7
  • Type

    conf

  • DOI
    10.1109/ISDA.2010.5687191
  • Filename
    5687191