Title :
Interrelated two-way clustering: an unsupervised approach for gene expression data analysis
Author :
Tang, Chun ; Zhang, Li ; Zhang, Aidong ; Ramanathan, Murali
Author_Institution :
Dept. of Comput. Sci. & Eng., State Univ. of New York, Buffalo, NY, USA
Abstract :
DNA arrays can be used to measure the expression levels of thousands of genes simultaneously. Most research is focusing on interpretation of the meaning of the data. However, the majority of methods are supervised, with less attention having been paid to unsupervised approaches which are important when domain knowledge is incomplete or hard to obtain. In this paper we present a new framework for unsupervised analysis of gene expression data which applies an interrelated two-way clustering approach to the gene expression matrices. The goal of clustering is to find important gene patterns and perform cluster discovery on samples. The advantage of this approach is that we can dynamically use the relationships between the groups of genes and samples while iteratively clustering through both gene-dimension and sample-dimension. We illustrate the method on gene expression data from a study of multiple sclerosis patients. The experiments demonstrate the effectiveness of this approach
Keywords :
DNA; biology computing; data analysis; genetics; pattern clustering; DNA arrays; cluster discovery; domain knowledge; gene dimension; gene expression matrices; gene patterns; interrelated two-way clustering; multiple sclerosis patients; sample dimension; unsupervised gene expression data analysis; Colon; Computer science; DNA; Data analysis; Data engineering; Entropy; Gene expression; Information analysis; Pharmaceuticals; Supervised learning;
Conference_Titel :
Bioinformatics and Bioengineering Conference, 2001. Proceedings of the IEEE 2nd International Symposium on
Conference_Location :
Bethesda, MD
Print_ISBN :
0-7695-1423-5
DOI :
10.1109/BIBE.2001.974410