DocumentCode
1921759
Title
Genetic search for optimal ensemble of feature-classifier pairs in DNA gene expression profiles
Author
Park, Chanho ; Cho, Sung-Bae
Author_Institution
Dept. of Comput. Sci., Yonsei Univ., South Korea
Volume
3
fYear
2003
fDate
20-24 July 2003
Firstpage
1702
Abstract
Gene expression profile is numerical data of gene expression levels from organism, measured on the microarray. In general, each specific tissue indicates different expression level in related genes, so that it is possible to classify disease by gene expression profile. For classification, it is needed to select related genes called feature selection, because all the genes are not useful for classification. We propose GA-based method for searching optimal ensemble of feature-classifier pairs of gene expression profile in seven feature selection methods based on correlation, distance, and information theory, and representative six classifiers. Experimental results on two gene expression profiles related to cancers show that GA finds good solution quickly. Especially, in Lymphoma dataset, GA finds the ensemble of 100% accuracy.
Keywords
DNA; genetic algorithms; information theory; medical computing; pattern classification; search problems; DNA gene expression profiles; Lymphoma dataset; cancers; deoxyribonucleic acid; feature classifier pairs; feature selection; genetic algorithm; genetic search; information theory; microarray; optimal ensemble; Cancer; Computer science; DNA; Gene expression; Genetics; Information analysis; Monitoring; Sequences; Testing; Voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2003. Proceedings of the International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-7898-9
Type
conf
DOI
10.1109/IJCNN.2003.1223663
Filename
1223663
Link To Document