Title :
Lasso based gene selection for linear classifiers
Author :
Zheng, Songfeng ; Liu, Weixiang
Author_Institution :
Dept. of Math., Missouri State Univ., Springfield, MO, USA
Abstract :
Selecting a subset of genes with strong discriminative power is a very important step in classification problems based on gene expression data. Lasso is known to have automatic variable selection ability in linear regression analysis. This paper uses Lasso to select most informative genes to represent the class label as a linear function of the gene expression data. The selected genes are further used to fit linear classifiers for tumor classification. The proposed approach (gene selection and linear classification) was applied to 5 publicly available cancer datasets. Compared to other methods in literature, the proposed method achieves similar or higher classification accuracy with fewer genes.
Keywords :
biology computing; genetics; regression analysis; set theory; Lasso-based gene selection; automatic variable selection ability; gene expression data; gene subset; linear classification; linear regression analysis; tumor classification; Biomedical engineering; Cancer; Gene expression; Input variables; Linear regression; Mathematics; Neoplasms; Support vector machine classification; Support vector machines; Testing; Lasso; cross validation; leave-one-out; linear classifier; variable selection;
Conference_Titel :
Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-5121-0
DOI :
10.1109/BIBMW.2009.5332127