DocumentCode :
2769420
Title :
Probability density function estimation based over-sampling for imbalanced two-class problems
Author :
Gao, Ming ; Hong, Xia ; Chen, Sheng ; Harris, Chris J.
Author_Institution :
Sch. of Syst. Eng., Univ. of Reading, Reading, UK
fYear :
2012
fDate :
10-15 June 2012
Firstpage :
1
Lastpage :
8
Abstract :
A novel probability density function (PDF) estimation based over-sampling approach is proposed for two-class imbalanced classification problems. The Parzen-window kernel function is applied to estimate the PDF of the positive class, from which synthetic instances are generated as additional training data to re-balance the class distribution. Utilising the re-balanced over-sampled training data, a radial basis function (RBF) classifier is constructed by applying an orthogonal forward regression, in which the classifier´s structure and the parameters of RBF kernels are determined using a particle swarm optimisation algorithm based on the criterion of minimising the leave-one-out misclassification rate. The effectiveness of the proposed approach is demonstrated by an empirical study on several imbalanced data sets.
Keywords :
estimation theory; pattern classification; probability; radial basis function networks; regression analysis; PDF estimation; Parzen-window kernel function; RBF classifier; imbalanced classification; imbalanced two-class problems; orthogonal forward regression; over-sampling approach; probability density function estimation; radial basis function; training data; Covariance matrix; Estimation; Kernel; Probability density function; Standards; Training data; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
ISSN :
2161-4393
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
Type :
conf
DOI :
10.1109/IJCNN.2012.6252384
Filename :
6252384
Link To Document :
بازگشت