• DocumentCode
    1784764
  • Title

    A hybrid Machine Learning methodology for imbalanced datasets

  • Author

    Lipitakis, Anastasia-Dimitra ; Kotsiantis, Sotirios

  • Author_Institution
    Dept. of Math., Univ. of Patras, Patra, Greece
  • fYear
    2014
  • fDate
    7-9 July 2014
  • Firstpage
    252
  • Lastpage
    257
  • Abstract
    In the Machine Learning systems several imbalanced data sets exhibit skewed class distributions in which most cases are allocated to a class and far fewer cases to a smaller one. A classifier induced from an imbalanced data set has usually a low error rate for the majority class and an unacceptable error rate for the minority class. In this paper a synoptic review of the various related methodologies is given, a new ensemble methodology is introduced and an experimental study with other ensembles is presented. The proposed method that combines the power of OverBagging and Rotation Forest algorithms improves the identification of a difficult small class, while keeping the classification ability of the other class in an acceptable accuracy level.
  • Keywords
    learning (artificial intelligence); pattern classification; OverBagging algorithms; classifier; ensemble methodology; hybrid machine learning methodology; imbalanced datasets; low error rate; rotation forest algorithms; skewed class distributions; Accuracy; Bagging; Classification algorithms; Decision trees; Learning systems; Principal component analysis; Training; computational intelligence; ensembles of classifiers; imbalanced data sets; supervised machine learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information, Intelligence, Systems and Applications, IISA 2014, The 5th International Conference on
  • Conference_Location
    Chania
  • Type

    conf

  • DOI
    10.1109/IISA.2014.6878762
  • Filename
    6878762