DocumentCode :
2218607
Title :
KernelADASYN: Kernel based adaptive synthetic data generation for imbalanced learning
Author :
Tang, Bo ; He, Haibo
Author_Institution :
Department of Electrical, Computer and Biomedical Engineering, University of Rhode Island, Kingston, RI 02881, USA
fYear :
2015
fDate :
25-28 May 2015
Firstpage :
664
Lastpage :
671
Abstract :
In imbalanced learning, most standard classification algorithms usually fail to properly represent data distribution and provide unfavorable classification performance. More specifically, the decision rule of minority class is usually weaker than majority class, leading to many misclassification of expensive minority class data. Motivated by our previous work ADASYN [1], this paper presents a novel kernel based adaptive synthetic over-sampling approach, named KernelADASYN, for imbalanced data classification problems. The idea is to construct an adaptive over-sampling distribution to generate synthetic minority class data. The adaptive over-sampling distribution is first estimated with kernel density estimation methods and is further weighted by the difficulty level for different minority class data. The classification performance of our proposed adaptive over-sampling approach is evaluated on several real-life benchmarks, specifically on medical and healthcare applications. The experimental results show the competitive classification performance for many reallife imbalanced data classification problems.
Keywords :
Accuracy; Estimation; Kernel; Measurement; Sampling methods; Standards; Training data; Imbalanced learning; adaptive over-sampling; kernel density estimation; medical and healthcare data learning; pattern recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2015 IEEE Congress on
Conference_Location :
Sendai, Japan
Type :
conf
DOI :
10.1109/CEC.2015.7256954
Filename :
7256954
Link To Document :
بازگشت