DocumentCode
2513930
Title
Ensemble Learning Based on Active Example Selection for Solving Imbalanced Data Problem in Biomedical Data
Author
Lee, Min Su ; Oh, Sangyoon ; Zhang, Byoung-Tak
Author_Institution
Sch. of Comput. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
fYear
2009
fDate
1-4 Nov. 2009
Firstpage
350
Lastpage
355
Abstract
The imbalanced data problem is popular in biomedical classification tasks. Since trained classifiers using imbalanced data are mostly derived from the majority class, their prediction performance is poor for the minority class. In this paper, we propose a novel ensemble learning method based on an active example selection algorithm to resolve the imbalanced data problem. To compensate a possible sub-optimal classifier, our proposed ensemble learning methods aggregates classifiers built by the active example selection algorithm. We implement this ensemble learning method based on the active example selection algorithm using incremental naive Bayes classifiers. Our empirical results show that we greatly improve the performance of classification models trained by five real world imbalanced biomedical data. The proposed ensemble learning methods outperforms other approaches by 0.03~0.15 in terms of AUC which solve imbalanced data problem.
Keywords
Bayes methods; learning (artificial intelligence); medical computing; pattern classification; biomedical data classification; ensemble learning; example selection algorithm; imbalanced data problem; incremental naive Bayes classifier; prediction performance; trained classifier; Bioinformatics; Biomedical engineering; Computational efficiency; Computer science; Data engineering; Iterative algorithms; Learning systems; Machine learning algorithms; Sampling methods; Training data; Active example selectioin; Ensemble learning; Imbalanced data problem; Incremental naive Bayes;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
Conference_Location
Washington, DC
Print_ISBN
978-0-7695-3885-3
Type
conf
DOI
10.1109/BIBM.2009.44
Filename
5341761
Link To Document