DocumentCode :
1309527
Title :
Finding Correlated Biclusters from Gene Expression Data
Author :
Yang, Wen-Hui ; Dai, Dao-Qing ; Yan, Hong
Author_Institution :
Dept. of Math., Sun Yat-Sen (Zhongshan) Univ., Guangzhou, China
Volume :
23
Issue :
4
fYear :
2011
fDate :
4/1/2011 12:00:00 AM
Firstpage :
568
Lastpage :
584
Abstract :
Extracting biologically relevant information from DNA microarrays is a very important task for drug development and test, function annotation, and cancer diagnosis. Various clustering methods have been proposed for the analysis of gene expression data, but when analyzing the large and heterogeneous collections of gene expression data, conventional clustering algorithms often cannot produce a satisfactory solution. Biclustering algorithm has been presented as an alternative approach to standard clustering techniques to identify local structures from gene expression data set. These patterns may provide clues about the main biological processes associated with different physiological states. In this paper, different from existing bicluster patterns, we first introduce a more general pattern: correlated bicluster, which has intuitive biological interpretation. Then, we propose a novel transform technique based on singular value decomposition so that identifying correlated-bicluster problem from gene expression matrix is transformed into two global clustering problems. The Mixed-Clustering algorithm and the Lift algorithm are devised to efficiently produce δ-corBiclusters. The biclusters obtained using our method from gene expression data sets of multiple human organs and the yeast Saccharomyces cerevisiae demonstrate clear biological meanings.
Keywords :
biology computing; genetics; pattern clustering; DNA microarrays; biclustering algorithm; clustering methods; correlated bicluster pattern; gene expression data; gene expression matrix; lift algorithm; mixed-clustering algorithm; Biclustering; biology computing.; data mining; gene expression data; pattern classification; singular-value decomposition;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2010.150
Filename :
5560654
Link To Document :
بازگشت