DocumentCode
3543367
Title
Learning from imbalanced data using methods of sample selection
Author
Chairi, I. ; Alaoui, Souad ; Lyhyaoui, Abdelouahid
Author_Institution
LTILab, Abdelmalek Essaadi Univ., Tanger, Morocco
fYear
2012
fDate
10-12 May 2012
Firstpage
254
Lastpage
257
Abstract
The majority of Machine Learning (ML) habitually assume that the training sets used for learning are balanced. However, in real world application this hypothesis is not always true. The problem of between-class imbalance is a challenge that has attracted growing attention from both academia and industry because of his critical influence on the performance of machine learning. Many solutions are proposed to resolve this problem: Generally, the common practice for dealing with imbalanced data sets is to rebalance them artificially by using sampling methods. On the other hand, researches show that Sample Selection (SS) methods help to improve the accuracy during the learning process. The main idea of our work is to apply a technique of Sample Selection on the majority class to achieve an undersampling for the imbalanced data. This procedure consent to deal with the imbalance problem and to improve the performance of learning.
Keywords
data handling; learning (artificial intelligence); sampling methods; ML; SS methods; between-class imbalance; imbalanced data learning; learning process; machine learning; majority class; sample selection method; sampling methods; training sets; Accuracy; Artificial neural networks; Classification algorithms; IEEE transactions; Measurement uncertainty; Presses; Imbalanced data; Multi-Layer Perceptron; sample selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Computing and Systems (ICMCS), 2012 International Conference on
Conference_Location
Tangier
Print_ISBN
978-1-4673-1518-0
Type
conf
DOI
10.1109/ICMCS.2012.6320291
Filename
6320291
Link To Document