Title :
Analyzing the Escherichia coli gene expression data by a multilayer adjusted tree organizing map
Author :
Wei, Ning ; Gruenwald, Le ; Conway, Tyrrell
Author_Institution :
Sch. of Comput. Sci., Oklahoma Univ., USA
Abstract :
Using the DNA microarray technology, biologists have thousands of array data available. Discovering the function relations between genes and their involvements in biological processes depends on the ability to efficiently process and quantitatively analyze large amounts of array data. Clustering algorithms are among the popular tools that can be used to help biologists achieve their goals. Although some existing research projects employed clustering algorithms on biological data, none of them has examined the Escherichia coli (E. coli) gene expression data. This paper proposes a clustering algorithm called Multilayer Adjusted Tree Organizing Map (MA TOM) to analyze the E. coli gene expression data. In a semi-supervised manner, MATOM constructs a multilayer map, and at the same time, removes noise data in the previously trained maps in order to improve the training process. This paper then presents the clustering results produced by MATOM and other existing clustering algorithms using the E. coli gene expression data, and a new evaluation method to assess them. The results show that MATOM performs the best in terms of percentage of genes that are clustered correctly.
Keywords :
DNA; arrays; biological techniques; biology computing; genetics; microorganisms; noise; trees (mathematics); Escherichia coli gene expression data analysis; MATOM; correctly clustered genes percentage; evaluation method; functional genomics; multilayer adjusted tree organizing map; noise data removal; semi-supervised manner; Algorithm design and analysis; Bioinformatics; Biological processes; Clustering algorithms; Computer science; DNA; Gene expression; Genomics; Nonhomogeneous media; Organizing;
Conference_Titel :
Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on
Print_ISBN :
0-7695-1907-5
DOI :
10.1109/BIBE.2003.1188965