Title :
Boosting Evolutionary Support Vector Machine for Designing Tumor Classifiers from Microarray Data
Author :
Huang, Hui-Ling ; Chen, Yi-Hsiung ; Koeberl, Dwight D. ; Ho, Shinn-Ying
Author_Institution :
Dept. of Inf. Manage., Jin Wen Inst. of Technol., Taipei
Abstract :
Since there are multiple sets of relevant genes having the same high accuracy in fitting training data called model uncertainty, to identify a small set of informative genes from microarray data for designing an accurate tumor classifier for unknown samples is intractable. Support vector machine (SVM), a supervised machine learning technique, is one of the methods successfully applied to cancer diagnosis problems. This study proposes an SVM-based classifier with automatic feature selection associated with a boosting strategy. The proposed boosting evolutionary support vector machine (named BESVM) hybridizes the advantages of SVM, boosting using a majority-voting ensemble and an intelligent genetic algorithm for gene selection. The merits of the BESVM-based classifier are threefold: 1) a small set of used genes, 2) accurate test classification using leave-one-out cross-validation, and 3) robust performance by avoiding overfitting training data. Five benchmark datasets were used to evaluate the BESVM-based classifier. Simulation results reveal that BESVM performs well having a mean accuracy 94.26% using only 10.1 genes averagely, compared with the existing SVM and non-SVM based classifiers
Keywords :
biology computing; genetic algorithms; pattern classification; support vector machines; tumours; automatic feature selection; boosting evolutionary support vector machine; gene selection; intelligent genetic algorithm; majority-voting ensemble; microarray data; model uncertainty; supervised machine learning; tumor classifiers; Boosting; Cancer; Genetic algorithms; Learning systems; Machine learning; Neoplasms; Support vector machine classification; Support vector machines; Training data; Uncertainty;
Conference_Titel :
Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0710-9
DOI :
10.1109/CIBCB.2007.4221201