Title :
A novel feature selection method based on CFS in cancer recognition
Author :
Lu, Xinguo ; Peng, Xianghua ; Liu, Ping ; Deng, Yong ; Feng, Bingtao ; Liao, Bo
Author_Institution :
Sch. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
Abstract :
In recent years, the gene expression profiles are used for cancer recognition. But the researchers are disturbed by their large variables and small observes. In this paper, a novel feature selection method based on correlation-based feature selection(CFS) was proposed. Firstly, the measures of variable to variable and variable to observe were calculated respectively. Then we utilized heuristic search method to search the space of variable for selecting informative gene subset and the subset weight was computed using these measures. Through regression we obtained a subset of distinguished genes. Finally, the stratified sampling strategy was presented to obtain the most informative genes. And classification performance was tested to evaluate the proposed method. Ten-fold cross-validation experiment was performed in three datasets including leukemia, colon cancer and prostate tumor. The experimental results show that the proposed method can obtain the distinguished gene subset and different classifier can acquire better classification performance with this subset.
Keywords :
cancer; correlation methods; data analysis; medical computing; regression analysis; sampling methods; search problems; CFS; cancer recognition; classification performance; colon cancer; correlation-based feature selection; datasets; gene expression profiles; heuristic search method; informative gene subset selection; leukemia; prostate tumor; regression method; stratified sampling strategy; subset weight; Accuracy; Cancer; Colon; Gene expression; Niobium; Principal component analysis; Support vector machines;
Conference_Titel :
Systems Biology (ISB), 2012 IEEE 6th International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4673-4396-1
Electronic_ISBN :
978-1-4673-4397-8
DOI :
10.1109/ISB.2012.6314141