Title :
Selecting informative genes by Lasso and Dantzig selector for linear classifiers
Author :
Zheng, Songfeng ; Liu, Weixiang
Author_Institution :
Dept. of Math., Missouri State Univ., Springfield, MO, USA
Abstract :
Automatically selecting a subset of genes with strong discriminative power is a very important step in classification problems based on gene expression data. Lasso and Dantzig selector are known to have automatic variable selection ability in linear regression analysis. This paper employs Lasso and Dantzig selector to select most informative genes for representing the class label as a linear function of gene expression data. The selected genes are further used to fit linear classifiers for cancer classification. On 3 publicly available cancer datasets, the experimental results show that in general, Lasso is more capable than Dantzig selector in selecting informative genes for classification.
Keywords :
bioinformatics; cancer; classification; data analysis; genetics; genomics; medical information systems; molecular biophysics; Dantzig selector; Lasso selector; cancer classification; data classification; datasets; gene expression; informative genes; linear regression analysis; Cancer; Error analysis; Gene expression; Input variables; Linear regression; Logistics; Support vector machines; Dantzig selector; Lasso; cancer classification; gene selection;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-8306-8
Electronic_ISBN :
978-1-4244-8307-5
DOI :
10.1109/BIBM.2010.5706651