• DocumentCode
    1805211
  • Title

    Improved SMOTEBagging and its application in imbalanced data classification

  • Author

    Zhang Yongqing ; Zhu Min ; Zhang Danling ; Mi Gang ; Ma Daichuan

  • Author_Institution
    School of Computer Science, Sichuan University, Chengdu, China
  • fYear
    2013
  • fDate
    1-8 Jan. 2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Many real world data mining applications involve imbalanced data sets, When all kinds of data are unevenly distributed and the particular evens of interest may be very few when compared to the other class. Data sets that contain rare evens usually produces biased classifiers that have a higher predictive accuracy over the majority class, but poorer predictive accuracy over the minority class of interest. This paper presents a novel ensemble algorithm with improved SMOTE, and combines selective ensemble with Bagging, which balances the class distribution with improved SMOTEBagging algorithm. Experiments on four UCI data sets and protein-protein interaction experiments mentioned above prove the performance of the method.
  • Keywords
    Bioinformatics; Classification algorithms; Proteins; Support vector machine classification; Tin; Bagging; Imbalanced Datasets; SMOTE; SVM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Conference Anthology, IEEE
  • Conference_Location
    China
  • Type

    conf

  • DOI
    10.1109/ANTHOLOGY.2013.6784957
  • Filename
    6784957