DocumentCode :
3777313
Title :
Gene selection for cancer clustering analysis based on expression data
Author :
Taosheng Xu; Ning Su; Rujing Wang; Liangtu Song
Author_Institution :
Department of Automation, University of Science and Technology of China, Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei, China
Volume :
1
fYear :
2015
Firstpage :
516
Lastpage :
519
Abstract :
Although the genomics data are accumulated in an exponential growth, the molecular complexity of cancer is still hard to understand. The most remarkable characteristics of the genomic data are severely high-dimensional features with a small number of samples, such as gene expression data. The traditional data mining method has a limited ability to process these asymmetry datasets. In order to select the key genes from the high-dimensional gene expression data for cancer clustering analysis, gene selection based on proportional hazard model is applied in this paper. The proportional hazard model is a statistical approach used for survival risk analysis. The significant genes are selected for clustering analysis based on gene expression dataset. We demonstrate the effectiveness of this method on breast cancer and lung cancer. The experiments show a better cancer clustering result of separating of samples into distinct subclasses.
Keywords :
"Cancer","Hazards","Gene expression","Genomics","Bioinformatics","Analytical models","Data mining"
Publisher :
ieee
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2015 4th International Conference on
Type :
conf
DOI :
10.1109/ICCSNT.2015.7490801
Filename :
7490801
Link To Document :
بازگشت