DocumentCode :
2221779
Title :
A new evolutionary gene selection technique
Author :
Lancucki, Adrian ; Saha, Indrajit ; Lipinski, Piotr
Author_Institution :
Computational Intelligence Research Group, Institute of Computer Science, University of Wroclaw, Wroclaw, Poland
fYear :
2015
fDate :
25-28 May 2015
Firstpage :
1612
Lastpage :
1619
Abstract :
Microarray technology allows to investigate gene expression levels by analyzing high dimensional datasets of few samples. Selection of discriminative, differentially expressed genes from such datasets is important to differentiate, prognose and understand the underlying biological processes. In this regard, the paper presents a new evolutionary gene selection method based on Student-t Stochastic Neighbor Embedding (t-SNE), Differential Evolution (DE) and Support Vector Machine (SVM). Here the underlying classification task of SVM is used as an optimization problem of DE, while t-SNE provides better ordering of genes for selection purpose. Generally, t-SNE is used to reorder the genes in such a way so that similar genes are grouped together and dissimilar genes are kept further apart. These reordered genes are then fragmented into fixed-length partitions. Thereafter, from each partition, a gene is selected randomly to encode the initial population of DE along with the combination of its weight and threshold values in order to participate in fitness computation. In the final generation of DE, a subset of genes is selected based on higher classification accuracy. The proposed technique is tested on six publicly available microarray datasets concerning various cancerous tissues of Homo sapiens and yields a potential set of genes by providing prefect or nearly perfect classification accuracy. Moreover, the superiority of the proposed technique has been demonstrated in comparison with other widely used techniques. Finally, the achieved results have also been justified by a statistical test and allowed us to draw biological conclusions through the identification of Gene Ontologies.
Keywords :
Accuracy; Cancer; Kernel; Lungs; Signal to noise ratio; Support vector machines; Tumors; differential evolution; gene ontology; gene selection; microarray;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2015 IEEE Congress on
Conference_Location :
Sendai, Japan
Type :
conf
DOI :
10.1109/CEC.2015.7257080
Filename :
7257080
Link To Document :
بازگشت