Title :
A Novel Clustering Approach Based on the Manifold Structure of Gene Expression Data
Author :
Shi, Jinlong ; Luo, Zhigang
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Clustering is an effective approach for computing analysis of gene expression data. Various of clustering algorithms have been developed to give reasonable interpretations of biological data and discover biological meaningful patterns of cellular functions. Based on the manifold structure of gene expression data analyzed under the framework of geometric representation, a novel clustering approach is presented to reveal the nonlinear expression patterns. The novel clustering approach can be divided into the following computing steps. The first step is to construct a neighborhood graph for gene expression points through which the approximate geodesic distances between each two points can be obtained. Then, instead of Euclidean distance, approximate geodesic distance is exploited to reveal the similarity between gene profiles. Finally, via defining the geodesic distance between a cluster and a gene expression point, new clusters can be generated after essential iterative processes. Application of the approach to the yeast cell-cycle dataset validates its rationality and efficiency.
Keywords :
bioinformatics; cellular biophysics; genetics; genomics; graphs; microorganisms; pattern clustering; approximate geodesic distance; clustering approach; gene expression; manifold structure; neighborhood graph; nonlinear expression patterns; yeast cell-cycle; Biology computing; Clustering algorithms; Data analysis; Data mining; Euclidean distance; Fungi; Gene expression; Humans; Nonlinear dynamical systems; Pattern analysis;
Conference_Titel :
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-4712-1
Electronic_ISBN :
2151-7614
DOI :
10.1109/ICBBE.2010.5516914