Title :
Gene selection for leukemia subtype classification from gene expression profile
Author :
Li, Ying-Xin ; Zhu, Yun-Hua ; Ruan, Xiao-gang
Author_Institution :
Sch. of Electron. Inf. & Control Eng., Beijing Univ. of Technol., China
Abstract :
It is very important but difficult to identify which genes in gene expression data can contribute most to tumor subtype classification. An approach to select a small subset of genes for leukemia subtype classification from large scale gene expression profile is proposed in this paper. Having removed the noisy genes with little relevance to the classification task, the "sequential floating forward search" method was employed to generate candidate feature subsets consisting of informative genes, and then, a support vector machine was employed as a classifier to select the optimal feature subset with minimum classification errors. The results of our experiment showed that all the samples can be correctly classified without any error with only five genes.
Keywords :
cancer; feature extraction; genetics; pattern classification; support vector machines; tumours; classification errors; gene expression data; gene expression profile; gene selection; informative genes; leukemia subtype classification; optimal feature subset selection; sequential floating forward search method; support vector machine; tumor subtype classification; Cancer; Control engineering; Data analysis; Electronic mail; Gene expression; Large-scale systems; Neoplasms; Noise generators; Support vector machine classification; Support vector machines;
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
DOI :
10.1109/ICMLC.2004.1382042