Title :
Hybrid Kernel Machine Ensemble for Imbalanced Data Sets
Author :
Li, Peng ; Chan, Kap Luk ; Fang, Wen
Author_Institution :
Biomed. Eng. Res. Center, Nanyang Technol. Univ., Singapore
Abstract :
A two-class imbalanced data problem (IDP) emerges when the data from majority class are compactly clustered and the data from minority class are scattered. Though a discriminative binary support vector machine (SVM) can be trained by manually balancing the data, its performance is usually poor due to the inadequate representation of the minority class. A recognition-based one-class SVM can be trained using the data from the well-represented class only. However, it is not highly discriminative. Exploiting the complementary natures of the two types of SVMs in an ensemble can bring benefits from both worlds in addressing the IDP. Experimental results on both artificial and real benchmark data sets support the feasibility of our proposed method
Keywords :
pattern recognition; support vector machines; data balancing; discriminative binary support vector machine; hybrid kernel machine ensemble; imbalanced data problem; imbalanced data set; Biomedical engineering; Costs; Kernel; Machine learning; Morphology; Patient monitoring; Pattern recognition; Scattering; Support vector machines; Training data;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.643