DocumentCode :
617870
Title :
Building high level knowledge from high dimensionality biological dataset (NCI60) using Genetic Algorithms and feature selection strategies
Author :
Oliveira Silva, Reslley Gabriel ; de Souza Ribeiro, Marcos Wagner ; Rodrigues do Amaral, Laurence
Author_Institution :
Sch. of Comput., Fed. Univ. of Uberlandia, Uberlandia, Brazil
fYear :
2013
fDate :
20-23 June 2013
Firstpage :
578
Lastpage :
583
Abstract :
When it comes to complex biological problems the use of conventional computation techniques has shown not to be the best approach. With the aim of selecting small sets of genes, that have strong predictive correlations with a disease, the Genetic Algorithms (GAs) are being increasingly used. In this paper, we propose a hybrid approach, using methods of feature selection and a classifier based on GA as a tool to identifying a subset of relevant genes and developing high-level classification rules for the cancer dataset NCI60, revealing concise and relevant information about the application domain. As a result it was obtained a set of IF-THEN rules with few genes per class and high predictive power that can be used as a classifier and assist experts to understand the biologic relationship between the genes and the classes of cancer. Moreover, the accuracy of the proposed approach overcame the results obtained by traditional classification methods such as PART, J48, Naive Bayes, Random Forest and IBK, demonstrating that the rules balance interpretability, comprehensibility and prediction precision.
Keywords :
cancer; feature extraction; genetic algorithms; medical computing; pattern classification; GA; IF-THEN rules; biologic relationship; cancer dataset NCI60; classifier; comprehensibility; disease; feature selection strategies; genetic algorithms; high dimensionality biological dataset; high level knowledge; high-level classification rules; hybrid approach; interpretability; prediction precision; predictive power; subset identification; Accuracy; Cancer; Educational institutions; Gene expression; Genetic algorithms; Sociology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2013 IEEE Congress on
Conference_Location :
Cancun
Print_ISBN :
978-1-4799-0453-2
Electronic_ISBN :
978-1-4799-0452-5
Type :
conf
DOI :
10.1109/CEC.2013.6557620
Filename :
6557620
Link To Document :
بازگشت