DocumentCode :
3068226
Title :
A MLP-SOM Combination to Select Relevant Genes from High-dimensional DNA Microarray Data
Author :
Chakraborty, Goutam ; Patra, Jagdish ; Noda, Seiryo ; Chakraborty, Basabi
Author_Institution :
Iwate Prefectural Univ., Iwate
fYear :
2007
fDate :
15-18 Dec. 2007
Firstpage :
999
Lastpage :
1004
Abstract :
More and more instruments for sophisticated biological measurements at micro-level are now available, leading to increasing quantities of gene expression data being collected. mRNA or cDNA expression levels for several thousands of genes are measured, but for practical reasons the number of data points (samples) are only in dozens. Some of the data are temporal, the change of gene-expression over a period of time. Other data sets are snap-shots at an instant of time. In this work, we consider such a non-temporal gene expression data to identify the few genes (out of thousands) which are sufficient for a target classification of the presence/absence of some disease. A general framework is proposed where, after eliminating genes having poor correlation with target classes, data dimension is further reduced with a two stage supervised and unsupervised artificial neural network classifier. The supervised classifier is a multilayer perceptron (MLP), whose input to hidden unit weight vectors are used as input to a clustering algorithm, here a self-organizing map (SOM). From the cluster centers we identify the responsible genes.
Keywords :
DNA; biology computing; feature extraction; genetics; learning (artificial intelligence); molecular biophysics; multilayer perceptrons; pattern classification; pattern clustering; self-organising feature maps; MLP-SOM combination; cDNA expression levels; clustering algorithm; data dimension; high-dimensional DNA microarray data; mRNA expression levels; multilayer perceptron; nontemporal gene expression data; relevant gene selection; self-organizing map; supervised artificial neural network classifier; two-step feature extraction method; unit weight vectors; unsupervised artificial neural network classifier; Artificial neural networks; Biomedical signal processing; Classification algorithms; Clustering algorithms; DNA; Diseases; Gene expression; Information technology; Software measurement; Space technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology, 2007 IEEE International Symposium on
Conference_Location :
Giza
Print_ISBN :
978-1-4244-1835-0
Electronic_ISBN :
978-1-4244-1835-0
Type :
conf
DOI :
10.1109/ISSPIT.2007.4458020
Filename :
4458020
Link To Document :
بازگشت