DocumentCode
2218607
Title
KernelADASYN: Kernel based adaptive synthetic data generation for imbalanced learning
Author
Tang, Bo ; He, Haibo
Author_Institution
Department of Electrical, Computer and Biomedical Engineering, University of Rhode Island, Kingston, RI 02881, USA
fYear
2015
fDate
25-28 May 2015
Firstpage
664
Lastpage
671
Abstract
In imbalanced learning, most standard classification algorithms usually fail to properly represent data distribution and provide unfavorable classification performance. More specifically, the decision rule of minority class is usually weaker than majority class, leading to many misclassification of expensive minority class data. Motivated by our previous work ADASYN [1], this paper presents a novel kernel based adaptive synthetic over-sampling approach, named KernelADASYN, for imbalanced data classification problems. The idea is to construct an adaptive over-sampling distribution to generate synthetic minority class data. The adaptive over-sampling distribution is first estimated with kernel density estimation methods and is further weighted by the difficulty level for different minority class data. The classification performance of our proposed adaptive over-sampling approach is evaluated on several real-life benchmarks, specifically on medical and healthcare applications. The experimental results show the competitive classification performance for many reallife imbalanced data classification problems.
Keywords
Accuracy; Estimation; Kernel; Measurement; Sampling methods; Standards; Training data; Imbalanced learning; adaptive over-sampling; kernel density estimation; medical and healthcare data learning; pattern recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation (CEC), 2015 IEEE Congress on
Conference_Location
Sendai, Japan
Type
conf
DOI
10.1109/CEC.2015.7256954
Filename
7256954
Link To Document