Title :
AESNB: Active Example Selection with Naïve Bayes Classifier for Learning from Imbalanced Biomedical Data
Author :
Lee, Min Su ; Rhee, Je-Keun ; Kim, Byoung-Hee ; Zhang, Byoung-Tak
Author_Institution :
Sch. of Comput. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
Abstract :
Various real-world biomedical classification tasks suffer from the imbalanced data problem which tends to make the prediction performance of some classes significantly decrease. In this paper, we present an active example selection method with naiumlve Bayes classifier (AESNB) as a solution for the imbalanced data problem. The proposed method starts with a small balanced subset of training examples. A naive Bayes classifier is trained incrementally by actively selecting and adding informative examples regardless of the original class distribution. Informative examples are defined as examples that produce high error scores by the current classifier. We examined the performance of AESNB algorithm by using five imbalanced biomedical datasets. Our experimental results show that the naiumlve Bayes classifier with our active example selection method achieves a competitive classification performance compared to the classifier with sampling or cost-sensitive methods.
Keywords :
Bayes methods; data mining; learning (artificial intelligence); medical computing; pattern classification; AESNB algorithm; active example selection; biomedical classification; data mining; imbalanced biomedical data; imbalanced learning problem; naive Bayes classifier; Bioinformatics; Biomedical engineering; Computer science; Costs; Data engineering; Data mining; Data privacy; Diseases; Proteins; Sampling methods; active example selection; cost-senseitive learning; imbalanced data problem; naïve Bayes classifier; resampling;
Conference_Titel :
Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
Conference_Location :
Taichung
Print_ISBN :
978-0-7695-3656-9
DOI :
10.1109/BIBE.2009.63