Title :
Support vectors based correlation coefficient for gene and sample selection in cancer classification
Author :
Mundra, Piyushkumar A. ; Rajapakse, Jagath C.
Author_Institution :
Biolnformatics Res. Center, Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Correlation is a very widely used filter criterion for gene selection in cancer classification. However, it uses all the training samples in ranking, which may not be equally important for the classification. Using support vectors, we demonstrate that classical correlation coefficient based gene selection is biased because of the sample points away from classification margin. To remove such bias, we use only the support vectors for computation of correlation coefficient and propose a backward elimination based SVcc-RFE algorithm. The proposed method is tested on several benchmark cancer gene expression datasets and the results show improvement in classification performance compared to other state-of-the-art methods.
Keywords :
bioinformatics; cancer; pattern classification; support vector machines; SVcc-RFE algorithm; backward elimination; cancer classification; correlation coefficient; gene selection; sample selection; support vectors; Benchmark testing; Cancer; Costs; DNA; Data mining; Filters; Gene expression; Partitioning algorithms; Size measurement; Support vector machines;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2010 IEEE Symposium on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4244-6766-2
DOI :
10.1109/CIBCB.2010.5510689