• DocumentCode
    3543367
  • Title

    Learning from imbalanced data using methods of sample selection

  • Author

    Chairi, I. ; Alaoui, Souad ; Lyhyaoui, Abdelouahid

  • Author_Institution
    LTILab, Abdelmalek Essaadi Univ., Tanger, Morocco
  • fYear
    2012
  • fDate
    10-12 May 2012
  • Firstpage
    254
  • Lastpage
    257
  • Abstract
    The majority of Machine Learning (ML) habitually assume that the training sets used for learning are balanced. However, in real world application this hypothesis is not always true. The problem of between-class imbalance is a challenge that has attracted growing attention from both academia and industry because of his critical influence on the performance of machine learning. Many solutions are proposed to resolve this problem: Generally, the common practice for dealing with imbalanced data sets is to rebalance them artificially by using sampling methods. On the other hand, researches show that Sample Selection (SS) methods help to improve the accuracy during the learning process. The main idea of our work is to apply a technique of Sample Selection on the majority class to achieve an undersampling for the imbalanced data. This procedure consent to deal with the imbalance problem and to improve the performance of learning.
  • Keywords
    data handling; learning (artificial intelligence); sampling methods; ML; SS methods; between-class imbalance; imbalanced data learning; learning process; machine learning; majority class; sample selection method; sampling methods; training sets; Accuracy; Artificial neural networks; Classification algorithms; IEEE transactions; Measurement uncertainty; Presses; Imbalanced data; Multi-Layer Perceptron; sample selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Computing and Systems (ICMCS), 2012 International Conference on
  • Conference_Location
    Tangier
  • Print_ISBN
    978-1-4673-1518-0
  • Type

    conf

  • DOI
    10.1109/ICMCS.2012.6320291
  • Filename
    6320291