Title :
Gene Selection for Cancer Classification using Wilcoxon Rank Sum Test and Support Vector Machine
Author :
Liao, Chen ; Li, Shutao ; Luo, Zhiyuan
Author_Institution :
Coll. of Electr. & Inf. Eng., Hunan Univ., Changsha
Abstract :
Gene selection is an important problem in microarray data processing. A new gene selection method based on Wilcoxon rank sum test and support vector machine (SVM) is proposed in this paper. First, Wilcoxon rank sum test is used to select a subset. Then each selected gene is trained and tested using SVM classifier with linear kernel separately, and genes with high testing accuracy rates are chosen to form the final reduced gene subset. Leave-one-out cross validation (LOOCV) classification results on two datasets: breast cancer and ALL/AML leukemia, demonstrate the proposed method can get 100% success rate with final reduced subset. The selected genes are listed and their expression levels are sketched, which show that the selected genes can make clear separation between two classes
Keywords :
biology computing; cancer; genetics; pattern classification; support vector machines; SVM classifier; Wilcoxon rank sum test; cancer classification; gene selection; leave-one-out cross validation classification; microarray data processing; support vector machine; Breast cancer; Computer science; Data engineering; Data processing; Educational institutions; Entropy; Kernel; Support vector machine classification; Support vector machines; Testing;
Conference_Titel :
Computational Intelligence and Security, 2006 International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
1-4244-0605-6
Electronic_ISBN :
1-4244-0605-6
DOI :
10.1109/ICCIAS.2006.294156