• DocumentCode
    167913
  • Title

    DBBoost-Enhancing Imbalanced Classification by a Novel Ensemble Based Technique

  • Author

    Chunkai Zhang ; Pengfei Jia

  • Author_Institution
    Shenzhen Grad. Sch., Harbin Inst. of Technol., Shenzhen, China
  • fYear
    2014
  • fDate
    May 30 2014-June 1 2014
  • Firstpage
    210
  • Lastpage
    215
  • Abstract
    Classification with imbalanced data-sets has become one of the most popular issues in machine learning, since it prevails in various applications. For binary-class problem, the amount of instances from the majority class is significant larger than that from the minority class. Consequently, traditional classifiers achieve a better performance over the majority class, while unsatisfactory predictive accuracy over the minority class. The emergence of ensemble learning provides a possible solution of solving this concern. And there are many recent researches indicate that the combination of Boosting and/or Bagging with pre-processing techniques is an effective way to enhance the classification performance of imbalanced data-sets. Centered on binary-class imbalanced problem, to overcome the drawbacks of state-of-the-art approaches, this paper introduces a novel technique (DBBoost) based on the combination of AdaBoost with an adaptive sampling approach. Through supporting by statistical analysis, experiments show that DBBoost outperforms the state-of-the-art methods based on ensemble.
  • Keywords
    image classification; image sampling; learning (artificial intelligence); medical image processing; statistical analysis; AdaBoost; Boosting-Bagging combination; DBBoost; adaptive sampling approach; binary-class imbalanced problem; ensemble learning; imbalanced classification enhancement; imbalanced datasets; machine learning; novel ensemble based technique; preprocessing techniques; statistical analysis; traditional classifiers; unsatisfactory predictive accuracy; Accuracy; Bagging; Boosting; Classification algorithms; Electronic mail; Measurement; Standards; Classification; Ensemble learning; Imbalanced data-sets; Sampling approach;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Medical Biometrics, 2014 International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4014-1
  • Type

    conf

  • DOI
    10.1109/ICMB.2014.45
  • Filename
    6845852